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The idea that events are equipped with a partial causal order is central to our understanding of physics in the 
tested regimes: given two pointlike events A and B, either A is in the causal past of B, B is in the causal past 
of A, or A and B are space-like separated. Operationally, the meaning of these order relations corresponds to 
constraints on the possible correlations between experiments performed in the vicinities of the respective events: 
if A is in the causal past of B, an experimenter at A could signal to an experimenter at B but not the other way 
around, while if A and B are space-like separated, no signaling is possible in either direction. In the context 
of a concrete physical theory, the correlations compatible with a given causal configuration may obey further 
constraints. For instance, space-like correlations in quantum mechanics arise from local measurements on joint 
quantum states, while time-like correlations are established via quantum channels. Similarly to other variables, 
however, the causal order of a set of events could be random, and little is understood about the constraints that 
causality implies in this case. A main difficulty concerns the fact that the order of events can now generally 
depend on the operations performed at the locations of these events, since, for instance, an operation at A could 
influence the order in which B and C occur in A’s future. So far, no formal theory of causality compatible with 
such dynamical causal order has been developed. Apart from being of fundamental interest in the context of in¬ 
ferring causal relations, such a theory is imperative for understanding recent suggestions that the causal order of 
events in quantum mechanics can be indefinite. Here, we develop such a theory in the general multipartite case. 
Starting from a background-independent definition of causality, we derive an iteratively formulated canonical 
decomposition of multipartite causal correlations. For a fixed number of settings and outcomes for each party, 
these correlations form a polytope whose facets define causal inequalities. The case of quantum correlations in 
this paradigm is captured by the process matrix formalism. We investigate the link between causality and the 
closely related notion of causal separability of quantum processes, which we here define rigorously in analogy 
with the link between Bell locality and separability of quantum states. We show that causality and causal sepa¬ 
rability are not equivalent in general by giving an example of a physically admissible tripartite quantum process 
that is causal but not causally separable. We also show that there are causally separable quantum processes that 
become non-causal if extended by supplying the parties with entangled ancillas. This motivates the concepts of 
extensibly causal and extensibly causally separable (ECS) processes, for which the respective property remains 
invariant under extension. We characterize the class of ECS quantum processes in the tripartite case via simple 
conditions on the form of the process matrix. We show that the processes realizable by classically controlled 
quantum circuits are ECS and conjecture that the reverse also holds. 


I. INTRODUCTION 

The possibility for dynamical and indefinite causal structures in quantum theory and more general probabilistic theories has 
recently attracted a great deal of interest, both from a foundational point of view and in the context of quantum information 
processing [1-23]. Motivated by the long standing search for a theory of quantum gravity, where the causal structure is expected 
to be dynamical as in General Relativity but fundamentally probabilistic in nature, as well as by the exploration of novel quantum 
architectures beyond the standard circuit model, operational ways of thinking about causal order in a probabilistic setting have 
provided new perspectives on quantum mechanics, its possible applications, and routes for potential extensions. 

A general framework for the study of correlations between local experiments without the assumption of a predefined causal 
order between them was proposed in Ref. [4]. In this so called process framework, each experiment is associated with an 
input and an output system between which an experimenter can perform different operations, but no specific assumption about 
the existence of a causal structure in which the experiments are embedded is made. When the experiments take place at fixed 
locations in a background space-time in circumstances defined without post-selection, the causal structure of space-time imposes 
signaling constraints on the correlations between the experiments. For example, there can be signaling from one experiment to 
another only if the former takes place in the past light cone of the latter, but no signaling between space-like separated locations 
or from the future to the past is possible. In Ref. [4], it was shown that if the local operations are described by quantum 
mechanics, it is possible to conceive correlations that are incompatible with any underlying causal structure. Such correlations 
allow two parties, Alice and Bob, to establish correlations that violate a causal inequality, which is impossible if their operations 
take place in a causal order, even if that order is random. A similar possibility was subsequently shown to exist in a multipartite 
setting even when the local operations are purely classical [9], which in the bipartite case is not possible [4]. It is not known at 
present whether such joint processes could have a physical realization without post-selection, that is, whether one could prepare 
a setup that leads to correlations violating causal inequalities between separate experimenters who locally experience the validity 
of standard quantum mechanics. 
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Another peculiar effect that seems at odds with causality, which has a physical realization without post-selection, arises when 
local quantum operations are applied in an order that depends on the value of a variable prepared in a quantum superposition 
[3, 5, 6, 10, 17], a technique known as ‘quantum switch’ [3]. This approach allows achieving certain tasks that are impossible 
if the quantum operations are applied in a definite causal order. In contrast to the violation of a causal inequality, however, this 
conclusion depends on the assumed description of the local operations and is theory-dependent. 

So far, the analysis of these effects has relied on semi-rigorous considerations about what it means for a process to be compati¬ 
ble with ‘definite causal order’. A fully rigorous argument requires such considerations to be rooted in a clear notion of causality, 
which, however, in this background-independent setting has been lacking. Such a notion is expected to have a universal expres¬ 
sion which can be applied in the context of any number of parties, but how to formulate it turns out to be a nontrivial problem. 
Simple considerations in the multipartite case show that the causal order of a set of local experiments should most generally be 
considered to be a random variable that can depend on the settings of these experiments. The latter possibility cannot be excluded 
since compatibly with our intuition of causality we can conceive of scenarios in which the setting in a given local experiment 
can influence the order in which other experiments take place in the future. In other words, causality should be expressed as a 
rule that constrains the joint conditional probabilities for the events in the local experiments and the causal order between them, 
allowing for the possibility that causal configurations unfold as a result of events in the past. A formal theory of such dynamical 
causal order is essential not only for understanding the subject of indefinite causal order in quantum mechanics or more general 
theories, but also for the problem of inferring causal structure beyond the classic paradigm of underlying deterministic variables 
and static causal relations [24]. 

In this paper, we develop rigorous theory-independent and theory-dependent notions of causality in the process framework 
and characterize the structure and relations between the corresponding classes of processes they define. Section 11 is devoted to 
the theory-independent perspective, which contains our core result. We formalize the process framework in theory-independent 
terms and propose a definition of causality which allows for the possibility of dynamical causal order. We develop a number of 
concepts, such as multipartite signaling, reduced and conditional processes, and derive necessary and sufficient conditions for 
a process to be causal, which are expressed in the form of an iteratively defined canonical decomposition of the probabilities 
in the process. This decomposition can be understood as describing a causal ‘unraveling’ of the events in the experiment in a 
sequence, showing that the proposed notion of causality yields the structure expected from intuition. Apart from being logically 
non-trivial, this result has important conceptual implications - it presents us with an understanding of causal order as a random 
function on random events rather than the ordering of underlying locations in which events happen. This perspective is in the 
spirit of the idea of background independence in general relativity, according to which there are no underlying locations, but 
only events and the relations between them. In Section III, we focus on the quantum process framework, where we develop 
different theory-dependent notions of causality, which in principle have analogues in more general process theories too. Specifi¬ 
cally, we investigate several possible generalizations of the bipartite notion of causal separability, which was previously defined 
heuristically in the bipartite case by postulating a particular form of the quantum process matrix [4]. We show that this form 
can be understood as arising from the canonical decomposition of causal processes under the condition that each process in this 
decomposition is a valid quantum process. We define the multipartite concept based on this principle. We show that the sets of 
causal and causally separable processes are not equivalent in the multipartite case, by giving an explicit example of a class of 
processes that are causal but not causally separable. This example is based on the ‘quantum switch’ technique discussed earlier. 
We also show that, surprisingly, there exist causally separable (and hence causal) quantum processes that become non-causal if 
extended by supplying the parties with an entangled input ancilla. This example of ‘activation of non-causality’ is constructed 
based on a suitable modification of the non-causal process matrix of Ref. [4]. This observation motivates the concepts of ex- 
tensibly causal and extensibly causally separable (ECS) processes, for which the respective property remains invariant under 
extension with arbitrary input ancillas. We derive a characterization of the class of ECS quantum processes in the tripartite 
case in terms of simple conditions on the form of the process matrix, which generalize the known form of bipartite causally 
separable process matrices. In the bipartite case, causal separability and extensible causal separability are equivalent, hence the 
class of ECS processes can be regarded as another possible multipartite generalization of the previously known bipartite concept. 
Einally, we consider the class of processes realizable by classically controlled quantum circuits, which we show is inside the 
class of ECS processes. These, too, are equivalent to the causally separable processes in the bipartite case and provide a possible 
multipartite generalization based on a different principle. We conjecture that the processes that can be obtained by classically 
controlled quantum circuits are equivalent to the ECS processes, and hence are described by process matrices obeying the simple 
conditions we have derived. We provide arguments in favor of this conjecture based on analysis in the tripartite case. In Section 
IV, we summarize our results and discuss future research directions. 
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II. THE PROCESS ERAMEWORK 
A. General processes 

The process framework introduced in Ref. [4] describes probabilities for the outcomes of local experiments associated with 
different parties, Alice, Bob, Charlie, etc., performed in abstract circumstances defined without assuming the existence of a 
global causal order between the experiments, but only a local order of the events in each of them. Each local experiment can 
be thought of as performed inside an isolated laboratory, where, at a given instant, an input system is received in the laboratory, 
it is subject to some operation that yields one of a set of possible outcomes, and, at a given later instant, an output system 
is sent out of the laboratory. The input and output systems are assumed to provide the only means of information exchange 
between events in the laboratory and any events in the rest of the whole experiment. The framework in Ref. [4] was developed 
for the case where the local experiments are described by standard quantum mechanics, under a set of specific assumptions. 
These assumptions are that the joint probabilities of the outcomes of the local experiments are non-contextual functions of the 
transformations (described by completely positive (CP) maps) associated with the local outcomes, and that the local experiments 
can be extended to act on ancillas prepared in any joint quantum state. 

There is a straightforward way in which an analogous theory can be formulated starting from any generalized operational 
probabilistic theory that has a formulation in the circuit framework [25-28] following the construction in Ref. [4]. Indeed, the 
concepts of transformation and state are defined for any such theory, and so is the idea of a composite system that is employed in 
the notion of adding an ancilla. [Note that the representation of the quantum process framework [4] in terms of process matrices 
(see Section III) is built around the Choi-Jamiolkowski isomorphism [29, 30], which may not be available for arbitrary theories, 
but this concerns the representation of the framework.] However, the above assumptions underlying the extension from a circuit 
theory to a process theory, albeit arguably natural, are by no means mandatory. For example, one can conceive of extensions of 
quantum theory in which the joint probability distributions are contextual, but nevertheless for each single party the marginal 
probabilities are non-contextual and consistent with standard quantum mechanics. One can also conceive of theories in which 
the allowed non-signaling ancillary resources are not quantum states, although they give valid non-contextual probabilities for 
the outcomes of any combination of local quantum measurements [32]. It is therefore of interest to formulate a general process 
framework in operational terms without additional assumptions about how that framework may be related to theories expressed 
in the circuit framework. This is important also for the question of understanding the concept of causal inequality introduced in 
Ref [4], which tests the compatibility of a process with an underlying causal structure in theory-independent terms. 

To this end, we will describe each local experiment, say that of Alice, by two variables - a setting s^, and an outcome of for 
that setting. What these variables are supposed to correspond to in practice will be discussed below. The possible settings for 
a given local experiment are assumed to belong to some set 5*^, and the outcomes for each value s^ of the setting to a set Of. 
Since we can formally extend the possible outcomes for each setting with fictitious outcomes that never occur, without loss of 
generality we can assume that the sets Of are identical for all 6 S^, i.e.. Of = O^. A particular event in Alice’s laboratory is 
thus described by a pair of variables (s'^, o^) e S'^xO^. An operation is a collection of possible events ((s'^, o'^)](/£cr' for a fixed 
value of s'^ 6 S^. The very occurrence of the local experiments, as well as the circumstances in which they take place, would 
be conditional on some variable that we will denote by which belongs to some set ® ' of possible such variables. 

What the variable vtA ®’*'’ ' is supposed to correspond to in practice will also be discussed below. 


Definition II.l. (Process): Mathematically, we define a process AV'^ ®’ ■ for a set of local experiments (or patties) S = (A, B, • ■ ■ ] 
as the collection of conditional probabilities 


= jB(o^,o®,...|/,/,...,v/'*'-)), 
o^ eO\s^ eS\ XeS, 


( 1 ) 


for a given value of w'^ ®’ ' e 


Definition II.2. (Trivial process): For the purposes of expressing more succinctly certain conditions later, it is convenient to 
allow the set of local experiments S - (A,B, • • ■ ] in the definition to be the empty set (] as a special case. In that case, the 
corresponding process will be referred to as the trivial process. We define it to consist of a single probability - that for the trivial 
outcome given the trivial setting - which is equal to 1. 

In abstract terms, a theory in the process framework is specified by listing the different types of input and output systems, 
all possible settings and outcomes between input and output systems of specific types, all possible variables vt/' ®’ ' for which 
we have a valid occurrence of a set of local experiments S - (A,B, • • ■ ], and the corresponding processes (1). Similarly to 
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operational probabilistic theories in the circuit framework [25-28], it is understood that equivalence classes of the variables s^, 
o^, and , with regard to the probabilities (1) are taken, and these variables are identified with their equivalence classes. 

But what are these variables supposed to describe in practice? In Refs. [15, 16], it was argued that there are two main ideas that 
underlie the concept of operation in the standard circuit framework for operational probabilistic theories [25-28]. The first one, 
termed the closed-box assumption, is the idea that the input and output systems of an operation are the only means of information 
exchange responsible for the correlations between the outcomes of that operation and the outcomes of other operations in the 
global experiment. The second idea, termed the no-post-selection criterion, which makes sense assuming a predefined notion of 
temporal ordering as in the standard circuit formulation, is that the variable that defines an operation, or the setting s^, can be 
known with certainty before the time of interaction with the input system unconditionally on any events in the future. 

Since no predefined global time is assumed in our picture, the latter condition will be imagined to hold only with respect to 
the local temporal sequence of events observed by each experimenter. Furthermore, we will assume that the variable that 

defines the global setup in which the individual experiments take place is also obtained without post-selection. We can make 
sense of this idea by imagining that the variable is associated with an event that fits within each of the local temporal frames of 
the experimenters and is such that it occurs before any of them receives the input system. We will call processes that describe 
experiments of this kind pre-selected processes. (For a generalization that admits post-selection, see Ref. [16]). 

For the rest of this paper, we will consider only pre-selected processes. We will drop the explicit specification ‘pre-selected’ 
for brevity, and will refer to them simply as processes, unless we want to explicitly emphasize the assumption of pre-selection. 
We will also drop the explicit specification of the variable on which the joint experiment is conditioned, and we will 

simply write = {p(o'^,o^, s ^,...)], keeping in mind that every process describes circumstances defined by such a 

variable and hence all probabilities we consider are implicitly conditional on such a variable. 


B. Causal processes 

In the circuit framework for operational probabilistic theories, causality is defined as the property that the probability distri¬ 
bution over the outcomes of a given operation in a circuit do not depend on what operations take place in the absolute future or 
absolute elsewhere [33] of that operation as defined by the strict partial order (SPO) of the circuit composition [26, 27]. More 
specifically, every circuit describes a set of operations taking place at the vertices of a directed acyclic graph, whose directed 
edges (the circuit ‘wires’) correspond to systems that go from one operation to another. Such a graph defines a SPO on the op¬ 
erations in a circuit (a precise definition of SPO is given below) - one operation is in the absolute past of another (equivalently, 
the latter is in the absolute future of the former) if there exists a directed path from the former to the latter through the graph. 
If there is no directed path connecting two operations, we say that one is the absolute elsewhere of the other. If we imagine 
that there is a local experiment taking place at every vertex of such a graph, the property of causality says that the probabilities 
for the outcomes of local experiments that are in the causal past or causal elsewhere of a given local experiment cannot depend 
on the setting of that experiment. A circuit theory that obeys this condition, such as standard quantum theory, is called causal, 
and for such a theory the SPO defined by the circuit composition can be interpreted as causal order [26, 27]. This interpretation 
corresponds to the intuitive idea that, if the setting of a local experiment is regarded as up to the ‘free choice’ of an experimenter, 
then any correlations between that setting and other variables must indicate a causal influence of the setting on those variables. 
From this perspective, causality can be understood as the condition that a variable can influence only variables in its immediate 
location or in its absolute future. 

In the process framework, we do not assume the existence of a given circuit in which the local experiments are embedded. 
Thus, there is no natural SPO with respect to which to define causality. Nevertheless, we may ask whether the probabilities 
described by a given process are compatible with the existence of a SPO with respect to which causality is satisfied. How 
to formulate this precisely, however, is not immediately clear because the process framework can describe situations in which 
the SPO may be random. For instance, it can describe the correlations between local experiments that can be embedded in 
different circuits according to some probability distribution. Clearly, if the SPO between the local experiments is random, it 
must be the case that conditionally on that SPO taking any particular value, the probabilities of the outcomes of the parties 
given their settings must obey the above notion of causality. This condition, however, is not sufficient to capture the idea of 
causality. For example, consider the local experiments of two parties, Alice and Bob, which are embedded at random in one of 
two possible causal circuits where they occur in different orders. The probabilities for all events and the specific circuit could be 
such that, conditionally on any particular circuit being realized, the joint probabilities of the outcomes of the parties given their 
settings obey the above notion of causality, but nevertheless the setting of Alice could be correlated with the circuit in which 
her experiment is embedded, and thereby with the SPO on the two local experiments. Intuitively, such a situation should be in 
conflict with causality, because if Alice’s setting could not influence events that occur in the past, it should not influence whether 
or not Bob performs an operation in the past. The circuit notion of causality cannot be used to define such an independence from 
the past, because there the past is defined assuming a fixed circuit. This indicates that we need a more general notion of causality 
that imposes constraints on how the SPO on the local experiments can depend on the parties’ settings. A simple possibility 
is to require that the SPO on the local experiments must be independent of the parties’ setting. This condition, however, is 
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too restrictive, because, compatibly with the idea of causality, we can conceive of scenarios where the setting of a given party 
influences the order in which other parties perform their experiments in that party’s absolute future. Thus, a more sophisticated 
definition of causality is needed for the process framework. We next develop such a definition. 

First, let us review the properties of SPO and introduce some terminology. A SPO on a nonempty set of local elements 
S - {A, B, C, ■ ■ ■) is a binary relation < which satisfies the following conditions: (1) irreflexivity - not A<A; (2) transitivity 
- if A<B and B<C, then A<C; (3) anti-symmetry - if A<B, then not B<A. When two local experiments A and B satisfy A<B 
(equivalently, B>A), we will say that A is in the absolute past of B, or that B is in the absolute future of A [33]. It will be 
convenient to introduce the notation A^B (equivalently, B^A), which means Ai^B and not A<B, that is, A and B are different and 
A is not in the absolute past of B (equivalently, B is not in the absolute future of A). We will also introduce the notation A:^^B, 
which means AjfB and A^B, that is, A and B are different and A is neither in the absolute past nor in the absolute future of B 
(and hence, B is neither in the absolute past nor in the absolute future of A). In the case when A^^B, we will say that A and 
B are absolutely independent, or that A is in the absolute elsewhere [33] of B (and similarly, B is in the absolute elsewhere of 
A). A prototypical example of these relations is the causal order between the points in a Minkowski space-time - the absolute 
past/future of a given point corresponds to the points in the past/future light-cone of this point, excluding the point itself, while 
the absolute elsewhere consists of the points that are space-like separated from the point. 

Note that if a set of elements S - [A, B, ■ ■ ■ ] is equipped with a SPO, the elements X and Y in any pair (X, F) 6 x are 
related by X<Y, X>Y, Xi^Y, or X-Y. The SPO on the set S - (A,B, ■ ■ ■ ] is equivalently described by the list of respective 
relations for each such pair, which we will denote by k(A, B, ■ ■ ■). (This list obviously must respect the properties of SPO listed 
above.) Since for pairs iX,X) of identical elements this relation is trivially X = X, when we explicitly describe k{A, B, • ■ •), we 
will only list the pairwise relations for all pairs of distinct elements of the set (if any). Note that this description is generally 
redundant due to the transitivity of SPO. If we are given the pairwise relations for a set = (A, B, • ■ ■ ], we have, in particular, 
pairwise relations for any nonempty subset S' = {X,Y,---} c S, i.e., a SPO k(A, B, ■ ■ ■) on implies a SPO k(X, T, ■ ■ ■) on 
*S' c *S, *S' ^ (]. 

As discussed above, the SPO k(A, B, ■ ■ ■) on a set of local experiments S - (A, B, ■ ■ ■ ] in terms of which causality would 
be defined can most generally be random and correlated with the events in these experiments. The notion of causality would 
impose constraints on the possible correlations. We want these constraints to formalize the following intuition about causality: 

The choice of setting in a local experiment cannot affect the occurrence of events in the absolute past or absolute elsewhere 
of that experiment, nor the SPO on such events and the experiment in question. 

Since a process is defined by the conditional probabilities for the outcomes of the local experiments given their settings and 
does not assume the existence of probabilities for the settings, we will formulate the above constraint at the level of probabilities 
conditional on the settings. We define this as follows. 

Definition II.3. (Causal process): A process = {p{o'^,o^, ■ ■ ■ «*,■■■)] for a nonempty set of local experiments 

S - |A,B, ■■■] is called causal if and only if there exists a probability distribution p{k{A,B, - ■ ■),o^,o^ ■ ■\s^,s^ ,■ ■ ■), 
) piK{A, B, ■ ■ ■), o^, o®, ■ ■ ■ |s^, s*, ■ ■ ■) = p{o^, o^, ■ ■ ■ |s'^, s^, - ■ ■), where the random variable k{A, B, ■ ■ ■) takes values 
in the possible SPOs on .S = (A, B, ■ ■ ■ ], such that for every local experiment, e.g. A, every subset X - (3f, F, ■ ■ ■ ] of the rest of 
the local experiments, and every SPO k{A, X,Y,---) = k{A, X) on the local experiment in question and that subset, we have 


p(K(A,X),AiX,o^\s^,s^,---)^ p(K(A,X),AiX,o'^\s‘^,---). (2) 

Here, o"^ denotes collectively the outcomes of all local expriments in X, and A^X denotes the condition that all these local 
experiments are in the causal past or causal elsewhere of A (i.e., A f X,A f F, • • •, for all X,Y,--- e X). [The probability 
p{K{A,X),AiX,o^\s'^,s^,- ■ ■) is understood obtained from p{k{A, B, - ■ ■),o^,o^, - ■ ■ Is'^, s^, ■ ■ ■) by summing over all cases in 
which k(A, B, • • ■) is compatible with k(A, X) and A:^X (obviously, if k(A, X) itself is not compatible with A^X, the respective 
probability is zero) and over all possible outcomes of the local experiments in the complement of X]. 

Remark. A monopartite process is trivially causal. 

For a process that is causal, the binary relation < of the SPO k{A, B, ■ ■ ■) can be interpreted as causal order. In 

that case, we will use the terms ‘causal pasf, ‘causal future', ‘causal elsewhere' and ‘causally independent' in the place of 
‘absolute past’, ‘absolute future’, ‘absolute elsewhere’ and ‘absolutely independent’, respectively. We will also refer to the list 
of pairwise relations k{A, B, • ■ •) as the causal configuration of the local experiments (in the case of a monopartite process, the 
causal configuration is trivial). 

Our goal next is to understand the structure of causal processes that arises from this definition and show that it corresponds 
exactly to what one expects from intuition. 
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C. Fixed-order causal processes, (no) signaling, reduced and conditional processes 

Before we consider the case of general causal processes, it will be instructive to investigate the special case of causal processes 
for which the causal conhguration of the local experiments is fixed. As we will show, the constraints on such processes can be ex¬ 
pressed via the concept of signaling, which we develop below. We also introduce several related concepts that will be of use later. 

Definition II.4. (Fixed-order causal process): A process " = {p(o^, o^, ■ ■ ■ |s'^, s*, ■ ■ ■)) is called fixed-order causal if 
it is compatible with a deterministic causal configuration, i.e., if it satisfies condition (2) for a SPO k{A, B,- ■ ■) that takes a 
particular value k{A, B,- ■ - /c»(A, B,- ■ ■) with unit probability for all possible settings of the parties: 


P(k(A,B, ■ ■ ■),o^,o^, ■ ■ ■ |s'^, ■ ■ ■) = 0, 

iff k(A,B,---)* k,(A,B,---), 

- ,Vo'^ 6 6 . 


( 3 ) 


Since our definition of causal process implies that the setting of a local experiment cannot be correlated with the outcomes 
of local experiments that are in the absolute past or absolute elsewhere of that experiment, one may expect that for any fixed 
causal configuration of the local experiments, causality would impose constraints on the possibility for signaling between them, 
similarly to the case in the circuit framework. In the case of two experiments, signaling can be defined as follows: 


Definition II.5. (Bipartite signaling): We say that there is no signaling from Alice (A) to Bob (B) in a bipartite process 
if and only if the probabilities of the process satisfy 


p{o%^ /) = p(o^, o%^ /) = p(o%\ (4) 

i.e., the marginal probabilities for the outcomes of Bob are independent of the setting of Alice for any possible setting of Bob. 
Equivalently, we say that there is signaling from Alice to Bob in the process if and only if this condition is not satisfied. 

For a fixed-order causal process AV^’^, where one of the relations A<B, B<A, or A^^B holds with unit probability for all 
settings of the parties, we can see that signaling is possible from one experiment to the other only if the former is in the causal past 
of the latter, which agrees with the notion of causality in the circuit framework [26, 27]. Indeed, assume for example that B<A, 
i.e., p{k{A, B) - B<A\s‘^, s^) = 1, 'is^ e 5"^, e (and hence p{k(A, B) - A<B\s'^, s^) - 0 and p(k(A, B) - Ai'^B\s^, s^) - 0, 
Ss^ 6 S^, 'is’^ e 5*). Then, we have 

p(o^|/,/) =p(A<B,o*|/,i^) -H p(B<A,o*|/,/) -H p{AMB,o’^\s^,s’^) 

- p(AiB,o^\s^,s^) - p(AiB,o^\s^) - p{o^\s^), (5) 

V/ e S^^s’^ eS’^,o^ e O’^, 

i.e., there is no signaling from Alice to Bob. In a similar way, we see that if A<B, there is no signaling from Bob to Alice, while 
if A^^B, there is no signaling from Alice to Bob and no signaling from Bob to Alice. 

In the case of more than two local experiments, the relevant generalization of the above notion of signaling may not be 
immediately obvious. Notice that if a given bipartite process AV^’^ involves no signaling between A and B, such a process is 
in principle compatible with the causal configuration A^^B (in fact, it is compatible with any causal configuration of the two 
parties). However, in the case of processes for more than two local experiments, even if there is lack of signaling between any 
pair of experiments for all possible settings of the rest of the experiments, the process may not be compatible with a causal 
configuration in which all experiments are causally independent. 

To see this, consider three local experiments performed by Alice, Bob, and Charlie, where each party’s input and output 
systems are classical bits, and each party is allowed to perform any classical stochastic operation from the input bit to the output 
bit. Let the experiments of Bob and Charlie be causally independent, and let Alice’s experiment be in the absolute future of 
Bob’s experiment, but in the absolute elsewhere of Charlie’s experiment (i.e., the causal configuration of the three parties is 
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FIG. 1: Certain types of multipatite signaling correlations do not involve bipartite signaling and do not imply the existence of a 
causal connection between any particular pairs of channels. The example discussed in the text could arise from any of the 

mechanisms sketched here. 


[B<A, BjtjtC]). Imagine that Charlie receives his input system in one of the two possible states 0 or 1 with probability 

1/2, and depending on that state, Alice and Bob are in one of the following two scenarios. In the first scenario (say, when 
Charlie receives 0), Bob receives a random bit as an input system, his output bit is sent unaltered into the input system of Alice, 
and Alice’s output bit is discarded. In the second scenario (when Charlie receives 1), Bob again receives a random input bit, but 
this time his output bit is flipped before sending it into Alice’s input, and Alice’s output bit is again discarded. In both cases, 
the output system of Charlie is discarded. Clearly, the described situation can be realized in agreement with a fixed causal 
configuration of the parties - all we need to do is supply Bob with a random bit and correlate the channel from Bob to Alice 
with the input system of Charlie, discarding the outcomes of Alice and Charlie. The mechanism realizing this is sketched in 
Fig. la. Note that the tripartite process corresponding to this scenario would involve no signaling from Bob to Alice in spite of 
the existence of a channel from Bob to Alice. This is the case irrespectively of what operation Charlie performs. Obviously, 
there can be no signaling from Alice to Bob either, since Alice operates in the future of Bob, nor can there be signaling between 
Alice and Charlie, or between Bob and Charlie, since Charlie is causally independent of both Alice and Bob. Thus, we have 
no signaling between any pair of parties, no matter what the setting of the third party is. Yet, the possible correlations between 
the parties cannot be realized if all parties are causally independent because if Alice and Charlie measure their input bits and 
collect the results of their measurements, they can infer the bit sent out by Bob, which is impossible if all parties are causally 
independent. We might say that in this case we have signaling from Bob to Alice and Charlie together. But intuitively, given the 
described scenario, this signaling should be from Bob to Alice only, since there is no channel connecting Bob’s output system to 
Charlie’s input. However, the latter conclusion is based on knowledge about the mechanism by means of which the correlations 
are established, or about the causal configuration of the parties, and does not follow solely from the correlations between them. 
Indeed, the tripartite joint probabilities for the outlined scenario are symmetric with respect to interchanging the roles of Alice 
and Charlie, and thus they could arise from a different mechanism in a situation where Alice is causally independent of both 
Bob and Charlie, and Charlie is in the causal future of Bob (Fig. lb). They could also arise from a channel from Bob to both 
Alice and Charlie (Fig. Ic) which transforms Bob’s output bit into either correlated or anti-correlated random input bits for 
Alice and Charlie. We therefore see that, at the level of the joint probabilities for the parties’ experiments, there is no way 
of distinguishing between these different mechanism of information transmission, and hence no way of giving a definition 
of signaling among a proper subset of the parties that unambiguously captures the existence of such a mechanism. We can, 
however, give an unambiguous definition of lack of signaling between two complementary subsets of the parties (Fig. 2), as 
well as an associated notion of multipartite signaling, generalizing the bipartite case. 


Definition II.6. (Multipartite signaling): Consider an n-partite process " for a set of local experiments S = {1, • • ■ ,n), 
n = 0,1, ■ ■ ■ (in the case of n = 0, this is understood as the empty set, and correspondingly the process is the trivial process). Let 
^ = {1, • • ■ ,k} and S - {k + I, - ■ ■ ,n},Q < k < n,he two complementary subsets of the experiments, ^U!B = S,^r\!B-{} 
(for simplicity, we take them to be the first k and the next n - k experiments, which can always be ensured by relabeling). We 
say that there is no signaling from the subset JTI to the complementary subset S in the process •" if and only if 
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- ,s") = - ,s"), (6) 

6 S^,o^ 6 O-', J = 1, • • • ,n. 

Equivalently, we say that there is signaling from (1 or ■ ■ ■ or A:) to (A: + 1 or ■ ■ ■ or n) if and only if this condition is not satished. 



FIG. 2: Pictorial representation of the dehnition of multipartite signaling. 

Remark. There is no signaling from or to the empty subset. 

Note that this dehnition only says whether there is signaling from one or more local experiments from a given subset to one 
or more local experiments from the complementary subset, but in the general case it does not identify pairs of experiments 
between which there is signaling. In the case of two experiments, the dehnition reduces to the notion of bipartite signaling 
dehned earlier. 

Definition II.7. (Non-signaling process): A process AV'■ for a set of local experiments S - {1, • • • ,n), n = 0,1, • • •, is 

called non-signaling if and only if there is no signaling from to S for any pair of complementary subsets ^ and S of S. 

Remark. Monopartite processes and the trivial process are non-signaling. 

From the dehnition of causal process, one easily obtains the following relation between the existence of multipartite signaling 
among the local experiments described by a given process and the causal conhguration of these experiments. 

Proposition II.l. In an n-partite hxed-order process AV'■ n > 1, compatible with a deterministic causal conhguration 

a:*) 1, • • • , n), there can be signaling from (1 or ■ ■ ■ or A:) to (A: -H 1 or ■ ■ ■ or n), only if at least one of {1, • • • , A:) is in the absolute 

past of at least one of {A: H- 1, ■ ■ ■ ,n] according to /c»(l, • • • , n). 

It turns out that we can formulate necessary and sufhcient conditions for a process to be hxed-order causal, which are 
expressed entirely in terms of the condition stated in Proposition II.l applied to different subsets of the experiments. To 
formulate the conditions precisely, we will need to introduce the concept of reduced process. 

Definition II.8. (Reduced process): Consider an n-partite process n > 0, for a set of local experiments S = {1, • • ■ n). 

Fet = (1, • • ■ ,k} and S - {A: -H 1, • • • ,n), 0 < k < n, be two complementary subsets of the experiments (specihed up to 
relabeling), such that there is no signaling from S to This means that 


p{o\--- ,/) = p(o\--- ,o%\--- ,s^), (7) 

Vi-' e 5 o^ 6 0\ j — 1 , • ■ ■ n, 

i.e., we have well dehned conditional probabilities p(o', ■ ■ ■ , o^|s*, ■ • • , s*^) for the experiments in The collection of these 
probabilities will be called reduced process for and will be denoted by s AV'’ ■ 

Note that if a multipartite process is a valid pre-selected process, any of its reduced processes is also a valid pre-selected 
process because it is dehned conditionally on the same pre-selected event. Note also that a general multipartite process need not 
admit any reduced processes apart from the trivial process and itself, since it may involve signaling from every proper subset of 
the local experiments to its complementary subset. 

Before we state the conditions for a process to be hxed-order causal, we introduce another concept that will be needed later. 


9 


Definition II.9. (Conditional process): Consider an n-partite process n > 0, for a set of local experiments S = 

{1, ■ ■ ■ ,n). Let ^ = {1, • • ■ ,k} and S - {k + 1, - ■ ■ ,n}, 0 < k < n,he two complementary subsets of the experiments (specified 
up to relabeling), such that there is no signaling from Sto^ (and hence we can define a reduced process 'W^ = For 

each fixed event (i*, o', • • • in ^ for which p(o*, • • • , o*^|i*, • • • consider the collection of conditional probabilities 

{p(o^^',--- ,o”|i^^',--- , i", i*,o', • • • ,i^,o*)). These can be thought of as an (n-A:)-partite process for S dependent on the event 
(i*,o',--- , o*) in The collection of these processes for all values of (i', o', •• • , o*^) for which p(o', • ,o^|i',--- , i^) 

0 will be called conditional process and will be denoted by ' The relation between the whole process and 

the reduced and conditional processes can be written in the compact form 


= o 


( 8 ) 


where the product o between and 'W^ denotes multiplication of the respective probabilities of these processes, when 

defined, for the same value of the event in 


p{o\--- ,o’'\s\--- ,s") = p(o 


k+\ 




, s", s',o', ■ ■ • , s*,o^) p(o*, ■ ■ ■ ,o^|s', ■ ■ ■ , s^). 


(9) 


for p(o', • • • , o^|s', ■ ■ ■ , s*^) 0, and 




, o |i 


, s") = 0, 


( 10 ) 


for p(o*, • • • , o^|i*, • • • ,s’‘)-Q. 


Proposition II.2. A process for a set of local experiments S - {1, • ■ • ,n), n > 1, is compatible with a determinis¬ 

tic causal configuration a-»(1, • ■ • ,n) of these experiments (and is thereby fixed-order causal) if and only if, for the assumed 
causal configuration. Proposition II. 1 holds for the full process and all of its reduced processes for all bipartitions of the local 
experiments into two complementary subsets. The Proof SI is given in the Appendix. 

We next turn to general causal processes, beginning with the bipartite case. 


D. Bipartite causal processes 

Consider a process describing the local experiments of two parties, Alice and Bob. If the process is causal, there exist 

probabilities p(A<B\s^,s^), p(B<A\s^,s^), p(Ai'^B\s^,s^), with p(A<B\s^,s^) + p(B<A\s^,s^) + piAi^Bls'^, s^) - 1. We can 
therefore write the joint probabilities of the process in the form 

p(o'^,o*|s'^,s^) = p(A<B\s'^, s^) p(o'^,o^\s‘^,s^,A<B) 

-I- p{B<A\s^,s‘^) p{o^,o‘^\s^,s'^,B<A) (11) 

H- p{Aims^,s^) p(o^,o‘^\s^,s‘^,AmB), 

where each of the probability distributions p(o‘^,o^\s'^,s^,A<B), p(o‘^,o^\s^,s^,B<A), and p{o^,o^\s'^, s^,A:i^B), is dehned 
assuming that p(A<B\s'^, s^) + 0, p{B<A\s^, s^) + 0, and p(A^t.B\s^, s^) + 0, respectively, otherwise that term is absent from the 
expansion. The definition of causality (2) implies that p{A<B\s^, s^) - p(A<B\s^), p{B<A\s^, s^) - p{B<A\s^), p{A^t.B\s^, s^) - 
piA^^B). Since the sum of these probabilities must be unity, we obtain p{A<B\s'^) - p{A<B), p{B<A\s^) - p{B<A), i.e., the 
causal configuration of the local experiments is independent of the parties’ settings. Thus, the probabilities of a bipartite causal 
process AVc'^ have the form 


p{o^,o^\s^, s^) - p{A<B) p{o^,o^\s^,s^,A<B) 

+ p{B<A) p{o^,o^\s^, s^,B<A) (12) 

+ p(Aimp{o^,o%^,s\AuB), 

where the probability distributions p{o^,o^\s^,s^,A<B) = p{A<B,o^,o^\s^,s^)lp{A<B), p{o^,o^\s^, s^,B<A) = 

p(B<A,o^,o^\s^,s^)lp{B<A), and p{o^,o^\s^ , s^,AitB) = p{A:(,-i.B,o^,o^\s^,s^)lp{A:t,tB), whenever defined, describe pro¬ 
cesses, which we will denote by and respectively. (Note that we can imagine that the causal configura¬ 

tion k(A, B) taking values A<B, B<A, or A:^tB, is associated with an event in the past of both A and B, i.e., the processes AV^^^, 


10 


and can be thought of as proper pre-selected processes.) The assumption of causality imposes conditions on 

these processes too. Specifically, it can be seen that each of them must obey a no-signaling constraint compatible with the 
concrete causal configuration it is conditioned on; the first one must involve no signaling from Bob to Alice, p{o^\s^, s^,A<B) - 
, A<By, the second one must involve no signaling from Alice to Bob, p{o^\s^, s^,B<A) - p{o^\s^, B<Ay, and the third 
one must involve no signaling in either direction, p{o'^\s'^, s^,A^'^B) = p{o'^\s'^,A^'^B), p{o^\s'^, ,A^^B) - 

i.e., these are fixed-order causal processes. In a compact form, we can write 

^ p{^A<B) + p{B<A) -i- p(AitB) (13) 

i.e., a bipartite causal process has the form of a probabilistic mixture of processes that are compatible with the different mutually 
exclusive causal configurations of the parties (and correspondingly involve only one-way signaling in the respective direction, 
or no signaling). This form is not only necessary but also sufficient for a process to be causal because it explicitly gives a joint 
probability distribution p{k{A, B),o^, s^) - p{k{A,B))p{o^,o^\s^,s^,k{A,B)) that obeys the condition for causality (2) 
when each conditional distribution pio'^, o^\s^, s^, k(A, B)) obeys the no-signaling constraints compatible with k{A, B). Indeed, 
we have 


p(A^B,o^|/,i^) = p(B<A,o*|/,i*) -H p{AitB,o’^\s^,s’^) 

- p{B<A) p{o^\s^,s^,B<A) + p(A:^iiB) p{o^\s^,s^,AitB) (14) 

- p(B<A) p{o^\s^,B<A) -1- p(AitB) p{o^\s^,AitB) - p(A^B,o*|s*), 

and similarly p{BiA,o^\s^,s^) — p(B:^A,o^\s^). 

Since the non-signaling probabilities p{o^,o^\s^, ,A:^tB) are compatible with the one-way signaling constraints for the 
cases A<B or B<A, we can also write the probabilities (12) in the non-unique form 

p(o^o^|/,/) = p{w^^’^)p{o\o%\s\w^^’^) + (15) 

where and are two mutually exclusive variables for which the experiments of Alice and Bob respect the relations 
AiB and BM, respectively, with the probabilities of these variables satisfying p{w^^^) + p(w^^^) — 1. In a compact form, this 
can be written 


= q AV^iB ^ Q<q<\, 

where AV^^^ is a process that involves no signaling from Y to X, i.e.. 


(16) 


^r^/YIX ^ r^/X ^ ^ j 

The constraint (16) (equivalently, (15)) provides a means of testing whether a given bipartite process theory is compatible with 
causal order. For every fixed number of settings and fixed number of outcomes for each party, the joint probabilities satisfying 
Eq. (15) form a convex polytope, which is the convex hull of the polytope of probabilities that involve no signaling from Alice 
to Bob, and the polytope of probabilities that involve no signaling from Bob to Alice [34]. The non-trivial facets of this ‘causal 
polytope’ define bipartite causal inequalities, similar to the one in Ref. [4], whose violation by a given process theory indicates 
that the theory is not compatible with causal order. Note that a causal inequality does not need to be a facet of the causal polytope 
- it may correspond to an external plane. For instance, the causal inequality of Ref. [4], which concerns the case where one 
party has a binary input and a binary output while the other one has a quaternary input and a binary output, is not a facet of 
the respective causal poly tope [21]. One way of seeing this is to note that the derivation of the inequality in Ref. [4] only 
used certain consequences of the requirement that the causal configuration of the parties must be independent of the parties’ 
settings, but not the full requirement. The bipartite causal polytope for binary inputs and binary outputs has been characterized 
by Branciard [34] (see Ref. [21]). 


E. The tripartite and u-partite causal processes 

In the case of more than two parties, causal processes need not have the simple form of probabilistic mixtures of fixed- 
order causal processes with probability weights that are independent of the parties’ settings. This is because, consistently with 
causality, we have the possibility that the causal configuration of a subset of the local experiments may depend on the settings 
of other local experiments in their past. For example, imagine that we have a tripartite experiment where the input and output 
systems of each party correspond to the internal (e.g., spin) degrees of freedom of a particle that enters the respective laboratory 
at a given instant and leaves it at a given later instant. The time at which each party receives her/his particle is determined by 
some predefined mechanism, which also governs any exchange of information taking place outside of the parties’ laboratories. 
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(Note that in order for the internal degrees of freedom of the particle to constitute the only means of information exchange 
between each local experiment and the rest of the experiment, the experiment should be so designed that no communication via 
the times of input or output of the parties is possible. For example, each party may be restricted not to possess any common time 
reference frame with the rest of the experiment and to perform her/his operation during a fixed time interval with a stopwatch.) 
In such a case, if Charlie receives a particle first, the operation that he applies on the system could affect the order in which 
Alice and Bob receive their particles afterwards, since we can conceive of a mechanism that selects different future scenarios for 
that order conditionally on the outcome of a measurement performed on the internal degrees of freedom of the particle coming 
out of Charlie’s laboratory. This can result in the different scenarios depicted in Fig. 3. By construction, the outlined setup is 
compatible with the condition that the setting of each local experiment can be chosen independently of events in the causal past 
and causal elsewhere of that experiment, as well as of the causal configuration of such events and the experiment in question, so 
it would be associated with a valid causal process. 


B A B A 



FIG. 3: In a causal setup where Charlie performs his experiment in the causal past of both Alice and Bob, the causal 
configuration of Alice and Bob may depend on the setting of Charlie. 

Clearly, the dependence of the causal configuration of the parties on the parties’ settings cannot be arbitrary, because it must 
agree with causality. To formulate the constraints on this dependence, we will need to introduce some more terminology. 

For any fixed causal configuration /f( 1, ■ ■ • , n) of the local experiments S - {I,-- - ,n), there are local experiments that are 
in no-one else’s causal future. The full set of such local experiments, c {1, ■ ■ ■ , n), will be referred to as the local 

experiments that are first, or as the^rsf consecutive set and will be denoted by [i,/, ■■■]'. Next, if the first consecutive set 
does not include all of the local experiments, there are local experiments whose causal past contains local experiments from 
[/, j, - ■ ■ and only from [i, j, - ■ ■ ]^. The full set of these will be referred to as the local experiments that are second, or as the 
second consecutive set, and will be denoted by [k, I, - ■ ■ Then, if the first and second consecutive sets do not include all local 
experiments, there are local experiments whose causal past contains local experiments from both sets [/, j, - ■ ■ and [k, I, - ■ ■ 
and only from those sets. The full set of these will be referred to as the local experiments that are third, or as the third consecutive 
set, and will be denoted by {p, q, - ■ ■ and so on. 

The following proposition will play a central role in our derivation of the form of multipartite causal processes. 

Proposition II.3. Consider a causal process for S — - ■ ,n], n > 1, with an associated joint probability distribution 

P{k{1, • • • ,n),o*, • • • • • • , i"), where a :( 1 , ■ ■ • ,n) are the causal configurations of the local experiments. The probabil¬ 

ity for the first K consecutive sets to consist of specific local experiments, [li, ■ ■ ■ , ni]\ ■ ■ ■, [Ik, ■ ■ ■ , hk]^, these experiments 
to have a specific causal configuration k(1i, ■ ■ ■ ,mk), the experiments in the first K - I consecutive sets to have a specific set 
of outcomes o'", ■ • ■ ,o"’^’, and a given (possibly empty) subset (Ik, • ■ ■ ,^k) c (Ik, • ■ • , hk) of the local experiments in the K'* 
set (given up to relabeling) to have specific outcomes can depend non-trivially only on the settings of the local 

experiments indicated in the first K - I consecutive sets and the subset (Ik, ■ • • 


pWi,--- ,«k),[1i,'" ,[1k,--- ,n^]^,o^\--- a") 

= /?(k(1i,--- ,nK),[li,--- ,[1 k,-" ,5^-^), (18) 

where we define the 0'^' set as the empty set. The Proof S2 is given in the Appendix. 

An important consequence of Proposition II.3 is that the probability for a given set of local experiments to be first is indepen¬ 
dent of the settings of all parties (this is the case of K = 1 and the subset { Ik, • • • , .Jk) being empty). For example, consider the 
different causal configurations of three parties - Alice (A), Bob (B), and Charlie (C) - which are compatible with [C]* (Fig. 3). 
Each of the individual configurations has a probability that may depend on the setting of Charlie, but the overall probability for 
Charlie to be first, i.e., for any one of these configurations to be realized (which is the sum of the probabilities for the individual 
configurations), is independent of the settings of all parties, including Charlie. This independence of the first consecutive set on 
the settings of all parties will play a key role in our characterization of the structure of multipartite causal processes. We will first 
develop the characterization for the case of three parties in order to illustrate the underlying principle, and then we will extend it 
to the general multipartite case. 
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Groups of tripartite causal configurations whose probabilities are independent of the parties’ settings, 

defined by the set of parties that are first 

_[A]*: [A<B, A<C, B<C] or [A<B, A<C, C<B] or [A<B, A<C, Bj^C] _ 

_[B]': [B<A, B<C, A<C] or [B<A, B<C, C<A] or [B<A, B<C, A^^C]_ 

[C]': [C<A, C<B, A<B] or [C<A, C<B, B<A] or [C<A, C<B, AMB] 

[A, Bf: [A^jlS, A<C, or [A^jlS, B<C] or [A^jlB, A<C, B<C] 

[A, C]': [A^jlC, A<B, or [A^jlC, C<S] or [A;^ 5 lC, A<6, C<S] 

[B, C]*: B<A, CjfjlA] or B:^^A, C<A] or B<A, C<A] 

[A,B,C]': [AitB,BitC,AitC] 


TABLE I: The mutually exclusive groups of tripartite causal configurations 


The groups of tripartite causal configurations compatible with the different possibilities for the first consecutive set of parties 
are listed in Table I. In terms of these possibilities, the probabilities of a tripartite causal process can be written 

p{,o^,o^,o^\s^,s^,s^) = p([Af) p(o'^,o*,o'^|/,/,i‘^,[A]‘) 

+ b([B]’) p{o^,o‘^, [B]’) 

+ p([Cf) p(o^, o®, [C]^) 

+ p{[A,Bf) p(o^,o‘^,o^\s^,s‘^,s'=,[A,Bf) (19) 

+ p{[A, C]') p{o^,o‘^, 0*^1/, /, [A, C]‘) 

+ p([B, C]’) p(o^,o^, s®, [B, Cf) 

+ p([A, B, Cf) pio^, o^, s^, [A, B, C]’), 

where 

pilAf) + pilBf) + pilCf) 

+ p([A,Bf) + p([A,Cf) + p([B,Cf) + p([A,B,CY) = 1, (20) 

and the probability distributions p(o'^, [■ ■ • ]’) for a given [■ • ■ defined whenever p{{- ■ • ]^) + 0, describe processes 

which we will denote by . (Note that we can imagine that the variable [■■■]' is associated with an event in the past of all 

local experiments, i.e., these can be thought of as a proper pre-selected process.) 

In a compact form, Eq. (19) can be written 

= p([A]') AVAI' + p([B]*) H- p{{Cf) 

-I- p([A,B]‘) -I- p([A,C]^)AVA^^^' + p([B,C]‘) (21) 

-I- p([A,B,C]‘) 

i.e., the overall process is a mixture of processes defined conditionally on the different scenarios [■ ■ ■ ]'. The processes AV^ '^' 
cannot be arbitrary but must be compatible with causality, the conditions for which we derive next. 

Consider the case in which one party is first, say [C]^ (Eig. 3). There are three distinct causal configurations compatible with 
this case, in which A<B, B<A, or A^^B (Table I). We can expand p(o'^, o^, o^\s‘^, s^, s‘^, [C]^) conditionally on these configura¬ 
tions as follows; 

p(o^,o^,o‘~\s‘^, s^, s^, [C]') = p(o‘^\s^, s^, s^, [C]^)x 

[/7(A<B|/, s^, s'=, o^, [C]') p{o^,o’^\s^, i®, o^,A<B, [C]^) -H 

p{B<A\s^, jB, 

P(A^5^B|/, s^, s^, [C]^) p{o^, o*|/, o^,AmB, [C]*)], 

where picA', o^\s'^, s^, s^, , k{A, B), [C]*) is defined when p{k{A, B)|i'^, s^, s^, , [C]') 0, and 

p(A<B|/, /, s^,o^, [C]') -H p(B<A|/, s^,o^, [C]‘) 

+p{AMB\s^,s'^,s^,o^,{Cf) = 1 . 


( 23 ) 
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From Proposition II.3, we have that 

[C]') = p{[C]\o^\s^, s\ s^)lp{[C]^) = p{[C]\o^\s^)lpm^) - p{o^\s^, [C]'). 

Similarly, we have 

p{A<B\s^, o^, [C]') = p{.A<B\s^, s^, o^, [C]'), 

p{B<A\s^, s®, o^, [C]’) = p{B<A\s’^, , [C]’), (24) 

p{AitB\s^, o^, [C]') = p{AitB\s^, o^, [C]'), 

which together with Eq. (23) implies 

p(A<B\s^, s^, s^, o^, [C]‘) = p(A<B\s^, , [C]‘), 

p(B<A|/, /, o^, [C]‘) = p{B<A\s^, o^, [C]‘), (25) 

p(Aims^,s^,s‘^,o‘^,[Cf) = piAitBls'^ ,0^^ ,[Cf)- 

Substituting this in Eq. (22), we obtain 

p(o^,o^, o^ls'^, s^, s*^, [C]’) = p(o^\s'=, [C]')x 

[p(A<B\s^,o^,[CY) p(o^,o‘^\s^,s^,s^,o^,A<B,[Cf) + p(B<A\s^, 0 ^^,[Cf) p(o'^,o‘^\s^,s‘^,s^,o‘^,B<A,[Cf) 

+ piAMB\s'^,o'^,[Cf) p(o^,o^\s‘^,s^,s'^,o'^,AmB,[C]% (26) 

with 


p(A<B\s‘=,o‘=, [C]*) + p(B<A\s^,o^, [C]‘) + piAMB\s‘=, o‘^, [C]’) = 1, (27) 

where the probability distributions p{o^,o^\s^,s^,s^, 0 ^,A<B,[C''^), p{o^,o^\s^, , 0 ^,B<A,[C''^), and 

p{o^, 0 '^,A:^^B,[C'^) describe bipartite processes for Alice and Bob for every fixed value of {s^,o'^). The 
assumption of causality implies conditions for these processes too. They must respect the no-signaling constraints imposed by 
the causal configuration k{A, B) they are conditioned on - the first one must involve no signaling from Bob to Alice, the second 
one must involve no signaling from Alice to Bob, and the third one must involve no signaling between Alice and Bob in either 
direction. This follows from the fact that 


p{o^, o®|/, s’^, s^, o^, k{A, B), [C]*) 


p([C]^/f(A,B),o'^,o^,o‘'|s^, s^, s^) 
p([C]i) p{oC\sC, [C]i) p{K{A,B)\sC,oC, [C]!)’ 


(28) 


and the observation that since only the numerator on the right-hand side depends on s^, o'^, s^, and o^, the respective no-signaling 
constraints on the quantity on the left-hand side follow from the requirement that the numerator is compatible with Eq. (2). 

Notice that the probabilities p(o^\s'^, [C]^ in Eq. (26) define a reduced monopartite process for Charlie, AT"', while the 
probabilities enclosed by the square brackets define a conditional bipartite process AV'c'*'*', which is causal (indicated by the 
subscript c) for every fixed (s^,o^). In a compact form, this can be written 


^ ^a.bic „ (29) 

The form (29) is necessary for a causal process for which all causal configurations that have non-zero probabilities respect [C]* 
(in that case, a causal process of the general form (21) reduces to the term ). It is also sufficient, because this form provides 
an explicit joint probability distribution p^^^'(k(A,B,C),o^,o^,o^\s^,s^,s^) - equal to p([Cf,K(A,B),(/',o^,o^\s^,s^,s^) - 
p(o‘'|i‘', [C]*) p(a'(A,B)| i‘',o*', [C]^) p{o'^,o^\s‘^,s^,s‘^,o‘^,j<(A,B),[C]^) when k(A,B,C) is compatible with [C]*, and to zero 
otherwise - for which condition (2) is satisfied with respect to every party. Indeed, condition (2) is satisfied with respect 
to C since the probability for any party being in the causal past or causal elsewhere of C is zero. It is also satisfied with 
respect to A (similarly for B) since the no-signaling constraints respected by p(o'^, s^, s^, , k{A, B), [C]^) guarantee that 

p[‘'''(A-(A,B, C), A;^B,A^C,o^,o^|i'^, s^, s^) - p'^^"^\k(A, B, C),AiB,AiC,o^,o‘'\s^, s^). The necessary and sufficient conditions 
for a causal process compatible with [A]^ and [B]^ are analogous. 

Let us now consider the case where two parties are first, say [B, C]^. The possible causal configurations in this case (Table I) 
are depicted in Fig. 4. Similarly to the previous case, using the assumption of causality, we can expand the probabilities 
pio'^, ...Is*^,..., [B, C]') conditionally on the different configurations as follows: 

p(o^, o®, o^\s^, i®, [B, C]‘) = p(o*, [B, C]‘)x 

[p(B<A, CmA\s'^, o®, o^, [B, C]^)p{o^\s^, s^, o^, , 0 =, B<A, C^A, [B, C]’) 

H- piBitA, C<A|/, o^, s^, o^, [B, Cf)p{o^\s^, /, o®, , BitA, C<A, [B, C]‘) 

H- p{B<A, C<A|s®, o^, o^, [B, Cf)p{o^\s^, s^, o^, s^, , B<A, C<A, [B, C]^)], 


(30) 
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with 


p{B<A, CitA\s’^, o’^, o^, [B, C]‘) + (BitA, C<A\s’^, o^, , [B, C]^)+ 

p{B<A,C<A\s‘^,o‘^,s'=,o'^,[B,Cf) = 1, 


( 31 ) 


where the probabilities p{o^, o^\s^, s'^, [B, C]') in Eq. (30) define a reduced bipartite process that involves no signaling between 
B and C, and the probabilities in the square brackets describe a conditional process for A. The fact that there is no signaling 
between B and C in the first process follows easily from Proposition II.3. 

It turns out that the decomposition over different causal configurations does not yield any nontrivial conditions on the proba¬ 
bilities of the conditional process enclosed in the square brackets, i.e., the simpler form 


pio^, o^, 0*^1/, s^, s^, [B, C]’) = p(o^, o^\s^, s^, [B, C]^) p{o%^, s^, o^, , [B, C]’) 


(32) 


is both necessary and sufficient for a valid Necessity is obvious since Eq. (30) implies Eq. (32). Sufficiency follows 

from the fact that the right-hand side of Eq. (32) is compatible with the particular case p(B<A, C<A\s^, o^, s'^, o^, [B, C]*) = 1, 
where the only non-trivial constraints on the probabilities p(o'^, o^, , s^, [B, C]^) imposed by k{A, B, C) are that there is 

no signaling from Alice to Bob and Charlie, and no signaling between Bob and Charlie in their reduced bipartite process. These 
are clearly guaranteed by Eq. (32) when the reduced process {p(o^, o^\s^, s'", [B, C]^)) involves no signaling between Bob and 
Charlie. Therefore, similarly to Eq. (29), we can write Eq. (32) in the compact form 

^ r^A\B,C „ ( 33 ) 

where ’Wnf is a non-signaling bipartite process for Bob and Charlie, and is a monopartite process for Alice conditional 

on the events in the laboratories of Bob and Charlie. 



EIG. 4; The three possible tripartite causal configurations included in the group where B and C are first. Erom left to right: 
[B^'^C and B<A and [B:aC and B<A and C<A], and and C<A]. 


Finally, in the case where all of the parties are first, we only have the constraint that 

is a tripartite non-signaling process. Again, this follows from Proposition II.3. 

Therefore, we have obtained that a tripartite causal process must have the form 

<\yA,B,C ^ p([^]I) <w^^c\A o AV® 

h-P([C]‘)AV'^’®I‘^oAV‘^ -I- p{[A,Bf)AV‘^'^’'^ oAV^f 

-Hp([A,C]‘) -I- pi[B,Cf)'W^'’^-‘^ oAV^f 

+p([A,B,Cf)'W^f’^, 


(34) 


(35) 


with suitable probability weights p([A]'), p([B]’), p([C]*), p([A,B]^), p([A,C]^), p([B, C]'), and p([A,B, C]'). This 
form is also sufficient for a tripartite process to be causal because it explicitly gives a probability distribution 
P{k{A,B,C),o'^,o^,o^\s'^,s^,s^) - p(['■ • ]') p(a-(A,B, C),o'^,o^,o‘'|i^, i'', [■••]*) that satisfies Eq. (2). Indeed, we 


have seen that each of the distributions p{k(A, B, C), o^, o", o' 


® I pA pC 


■s*'> [■ • ■ ]^) in this convex mixture is an extension of a causal 


process {p{o^, o^, o''|i'^, , s^, [■ • ■ ]*)), and hence it satisfies Eq. (2). Since the weights p{{- ■ ■ ]’) in the mixture are independent 

of s'^, s^, and s'', and Eq. (2) is linear in p{k{A, B, C), o'^, o^, o''|s'^, s^, s'' , [■ ■ ■ ]*), the equation is satisfied by the whole mixture 
too. 

Condition (35) can be further simplified by noticing that the processes corresponding to the cases in which two or three parties 
are first have forms compatible with cases in which only a single party is first. For instance, satisfies the necessary and 

sufficient conditions for a valid AV^^^ or a valid , while satisfies the necessary and sufficient conditions for any 

of AV^"^!', AV^^^\ or AT'f''^’. The compatibility of with [C]*, for example, can be seen from the fact that Eq. (32) (or 


15 


Eq. (33)) is compatible with the case [C]' in which C<B<A, since the only constraints in that case are that Alice cannot signal to 
Bob and Charlie, and that Bob cannot signal to Charlie, which are satisfied by the probabilities in Eq. (32). Similarly, 
is compatible with [C]^. A process is compatible with any causal configuration since it does not involve signaling 

between any of the parties. These observations suggest that we can group (in a generally non-unique way) the terms in the 
probabilistic mixture ( 21 ) so as to obtain a mixture of three processes 

+ ( 36 ) 

where y^(A,B)ic^ ^j.g some mutually exclusive variables whose probabilities satisfy + 

P(vj;(- 4 ,c) 5 <B) p(yyiAB)iC'^ _ gygjj that conditionally on these variables, the causal configuration of the parties belongs to 
one of the groups compatible with {B, C):^A (meaning A CM), (A, C)iB, and (A, B)^C, respectively, while the processes 
r\^{B,c)iA^ <yp(A,C)iB^ r\^(A.B)ic^ Satisfy the most general causal constraints compatible with these groups. Eor instance, con¬ 
ditionally on the causal configurations of the parties may belong to any of the groups defined by [A]\ [A, B]*, [A, C]^ 

and [A, B, C]^. The process ^^B,c)iA ^ould itself be a probabilistic mixture of processes compatible with these groups, which 
most generally satisfy the constraints satisfied by ^1-^1 . That is, 

<yp(B.C)M - '\>^^B,C\A ^ <yipA^ ( 3 , 7 ) 

<yp{A,C)iB _ <\fpA,C\B ^ ( 3 g) 

<yp(A.B)iC _ <\lipA,B\C o <\ipC ( 3 ^) 

Obviously, the existence of a convex decomposition (36) is both necessary and sufficient for a tripartite process to be causal, 
since any process of the form (35) can be written in the form (36), while Eq. (36) is a special case of Eq. (35). 

As in the bipartite case, for any fixed number of settings and fixed number of outcomes for each party, the constraint (36) pro¬ 
vides a means of testing whether the corresponding tripartite probabilities are compatible with causality. The set of probabilities 
that satisfy Eq. (36) is the convex hull of the probabilities compatible with causal configurations in which (B, C)M, (A, C)^B, 
and (A, B)iC. One can see that the latter form polytopes, since the constraints imposed by causality in each of these cases are 
linear. Eor example, in the case of (A,B)^C, we have the constraint that p{o^,o^s^, s '^involve no signaling 
from Alice and Bob to Charlie, and that for every (s^, o^), the resultant process between Alice and Bob is causal. The first 
requirement corresponds to a set of linear constraints. The second requirement corresponds to the condition that for every fixed 
the probabilities p(o'^,o^\s^,s^,s^,o^,w^^’^^^^) = are the proba¬ 

bilities describing a causal process for Alice and Bob, which themselves belong to a polytope and hence respect a set of linear 
inequalities. Plugging these probabilities in the respective inequalities and multiplying both sides by p(o^\s ^would 
yield a set of linear inequalities for pio'^, o®, s^, s*', Therefore, these probabilities also form a polytope, and so 

do the probabilities of the form (36). The nontrivial facets of the polytope of probabilities (36) would define tripartite causal 
inequalities, whose violation indicates incompatibility with causal order. Examples of tripartite causal inequalities for binary in¬ 
puts and outputs can be found in Refs. [ 7, 9] (we have not investigated whether these are facets of the respective causal polytope). 

The extension of the conditions for causality of a process to the case of n parties can be defined iteratively. The following 
theorem provides the generalization of condition (35): 


Theorem II.l. A process for a set of parties ,S = {1, • ■ ■ ,«),«> 1, is causal if and only if it can be written in the form 

^ oAVi, (40) 

Ac.S.A#!) 

where the sum is over all nonempty subsets X of the local experiments S, px are suitable probability weights (which can 
be interpreted as the probability for X to be first, px - b([A]')), S\X denotes the relative complement of X in S, is a 
non-signaling reduced process for X, and the conditional process AVc is either the trivial process (when A = ,S) or other¬ 
wise can be written in the same form (40) for every given value of the possible events in X. The Proof S3 is given in the Appendix. 

As in the bipartite and tripartite cases, we can simplify the conditions for an n-partite process to be causal by noticing that the 
constraints on a process compatible with a given set of A: (1 <k<n) parties being first are compatible with the constraints on a 
process compatible with the case in which only a single one of the k parties is first. Therefore, by an argument analogous to the 
one in the tripartite case, we obtain the following alternative formulation of the conditions. 
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Theorem II.2. (Canonical causal decomposition): A causal process for n parties is one that can be written in the (generally 
non-unique) form 


- q. > 0,V/, 

!=1 1=1 

with 

_ ,^1,-,«|i ^ 

where the (n - l)-partite conditional process ^^• ■’'“'>'+'■ ■•" 1 ' either trivial (when n = 
of the event in i. 

The weights qi in Eq. (42) can be thought of as the probabilities qi = for a mutually exclusive set of vari¬ 
ables for which the causal configurations of the parties belong to a group such that ( 1 , •• • ,i- 1 ,!H- I,--- 

Theorem II.2 (alternatively Theorem II. I ) gives iteratively formulated necessary and sufficient conditions for a process to be 
causal in the general multipartite case. It can be understood as describing an ‘unraveling’ of the different possible sequences of 
operations in steps: first, the party that is first and his/her monopartite process are selected at random based on some probability 
distribution; next, the party that is second and his/her monopartite process are selected at random from some probability distribu¬ 
tion that most generally can depend on the first party’s setting and outcome; next, the party that is third and his/her monopartite 
process are selected from some probability distribution that most generally can depend on the settings and outcomes of the first 
two parties, and so on. We refer to this intuitive decomposition as the canonical causal decomposition of a causal process. 

By an argument analogous to the one in the tripartite case, one easily sees from Theorem II.2 that for any fixed number of 
settings and outcomes for each party, the causal probabilities for n parties form a polytope, provided that the causal probabilities 
for (n-1) parties form a polytope. By induction, this implies a polytope structure for the general multipartite case. The nontrivial 
facets of such a polytope define causal inequalites. Examples of n-partite causal inequalities, where n - 2k + I, for binary inputs 
and outputs can been found in Refs. [7, 9]. It would be interesting to check if these inequalities are facets of the respective causal 
poly tope. 


= 1, (41) 

(42) 

1 ) or has the form (41) for every value 


III. THE QUANTUM PROCESS FRAMEWORK 
A. General quantum processes 

The quantum process framework introduced in Ref. [4] is a particular theory within the general operational framework for 
pre-selected processes discussed in the previous section. It is based on a set of assumptions about the local operations of the 
parties and the joint probabilities for their outcomes, which we review next. 

The first main assumption is that of local quantum mechanics [4], which says that each local experiment is described as in 
standard quantum mechanics. Specifically, let Xi and X 2 denote the input and output systems of a local experiment X. It is 
assumed that these systems are associated with Hilbert spaces and of dimensions - dxi and dimt/T^^ = dx2, 

respectively. The set of operations that can be performed between the input and output systems is the set of standard quantum 
operations (or quantum instruments [35]). A quantum operation has a set of outcomes labeled by j - Each outcome 

induces a specific transformation from the input to the output, which corresponds to a completely positive (CP) map AiJ : 

where JLi'H) is the space of linear operators over the (finite-dimensional) Hilbert space “K. The action of 
each AdJ on an operator cr e X('A(^‘) can be written in the Kraus form [36] AlJ(cr) = EjkcrE^jj^, m = dxidx2, where the 
Kraus operators Ejj^ : satisfy 2™=i E^ji^Ejh < 1^‘, V j. The set of CP maps (A1J| corresponding to all possible 

outcomes of a quantum operation has the property that YI'j=\ is CP and trace-preserving (CPTP), which is equivalent to the 
condition 2"=i 2^=1 E]^Ejk = 1^'. 

The second main assumption is that the joint probabilities for the outcomes of the operations of a set of parties, Alice, Bob, 
Charlie, ■ ■ •, is a non-contextual function of the local CP maps, 

pH, j, k, • • ■ KMf],(Mf],[M,^] ■ ■ ■) = coiMf, M^, Mi, ■■■). (43) 

The requirement that local procedures agree with standard quantum mechanics implies that the function a> should be linear in 
the local CP maps [4]. 
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Such a linear function can be written in a convenient form by expressing each local CP map as a positive semidefinite 
operator using a version of the Choi-Jamiolkowsky (CJ) isomorphism [29, 30]. In this isomorphism, the CJ operator Mf'^^ e 
® corresponding to a linear map Mf : is defined as := \I ® Af,-where 

is a (not normalized) maximally entangled state on two copies of , the set of states {|y))y^[ 
is an orthonormal basis of I is the identity map, and T denotes matrix transposition in the basis of and some 

specific basis of The CJ operator defined in this way does not depend on the choice of basis of but does depend 
on the choice of basis of [31]. For the purposes of the present paper, the latter basis can be an arbitrary fixed basis. We 
note, however, that within the time-symmetric generalization of the framework developed in Ref. [16], this basis has a nontrivial 
physical significance related to the transformation of time reversal. Specifically, in that formulation, the Hilbert space 'TY'^^ on 
which the CJ operator is defined is not interpreted as the original output Hilbert space of the CP map, but a time-reversed copy 
of it. In this paper, we will not be concerned with that formulation, but will simply regard the CJ representation of CP maps, 
defined for an arbitrary choice of basis, as a mathematical convenience. Using the CJ representation, the joint probabilities (43) 
can be written in the form 

/7(i,Y,k,---|(Mf],(M^],(M^],---) 

= Tr - ^ ^b,B 2 g, ^c.Cj g,... ^ ( 44 ) 

where jyA,A 2 B,B 2 C,C 2 - g ^(.^a, ^,^A 2 ® ® <^C 2 ^ .) 

The last main assumption behind the quantum process framework is that the local operations of the parties can be extended 
to act on input ancillas Aj, Bj, C[, ■ ■ •, that are allowed to be prepared in an arbitrary quantum state ' > 0, 

' = 1. Upon such an extension, the original operator ^AiA 2 BiB 2 CiC 2 - becomes vT'^iA 2 BiB 2 CiC 2 - ^ pA\B\c[ - 1 - 4 ) ^he 
requirement that the probabilities are non-negative for any combination of local CP maps A1^, AY®, AY^, ■ ■ ■, on the extended 
systems A = AiA'jA 2 , B - BiB'^B 2 , C — CiC[C 2 , • ■ ■, implies that [4] 

^A,A 2 BiB 2 CiC 2 - > ( 45 ) 


In addition, since the probabilities should sum up to 1 for a complete set of local outcomes, we have the condition that 

Tr |^vT'^A 2 BiB 2 CiC 2 - ^ 4 ^a,A 2 g, |g, j^CtC 2 g,... ^ ( 46 ) 

Y4^a,A2^^BiB2^4^CiC2^ ... > 0, 


where Tr^j denotes partial trace over X 2 . Here, we have used the fact that a linear map AY^ is CPTP if and only if its CJ operator 
satisfies > 0 and An operator VT-^i'^ 2 ®i^ 2 CiC 2 - jbat satisfies conditions (45) and (46) is called a process 

matrix [4]. Knowing the process matrix, by Eq. (44) we have the probabilities for the outcomes of any combination of local 
operations of the parties, i.e., the process matrix provides a complete description of a process. (Here, the set 5^ of possible 
settings of a given patty is the set of quantum operations with the respective input and output systems.) 

The process matrix can be expanded in a Hilbert-Schmidt basis of orthogonal matrices on the Hilbert spaces of the input 
and output systems of the parties, which is helpful in analyzing different properties of the correlations that the process allows. 

A Hilbert-Schmidt basis of -C('7Y^) is given by a set of Hermitian operators with cTq - 1^, Trcr^cr^ = dxS^y, and 

Trcr^ = 0 for j - 1, - 1. In such a basis, a process matrix can be written 


y^AxAiBiBnCiCr-- 


^ Wijklmn -crf' ® ® a-f ® erf- ® cr^‘ ® ® ■ ■ ■ , 

Wijklmn-.- 6 R, Vi, j, . 


(47) 


It turns out that many properties of process matrices can be formulated entirely as statements about the nonzero terms in the 
above expansion [4]. For this purpose, it is convenient to introduce the following terminology. Non-zero terms proportional to 
erf' ® (i > 1 ) will be called terms of type Ai, non-zero terms proportional to erf" 81 erf' ® I''”"'' (i, j > 1 ) will be called terms 
of type A 2 B 1 , etc. Every process matrix also contains a non-zero term proportional to the identity operator on all systems. This 
term will be referred to as of type 1 , or as the identity term. 

In Ref. [4], it was shown that, in the bipartite case, an operator satisfies condition (46) if and only if it contains at 

most terms from the following types; 1, Ai, By, A 2 B 1 , A 1 B 2 , A 1 A 2 B 1 , AiBiB 2 - This rule also includes the monopartite case, 
which is obtained when the input and output systems of one of the parties is trivial (the one-dimensional Hilbert space C'). 
Specifically, a monopartite operator satisfies condition (46) if and only if it contains at most terms of type 1 and Ai. The 

types of allowed terms can be generalized to the «-partite case as follows. 
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TABLE II: The types of terms allowed in a tripartite process matrix 


Cl 

B2C1 

Bi 

B1C2 

BiCi 

BICIC2 

B1B2C1 

A2CI 

A2B2C1 

A2B1 

A2BIC2 

A2B1C1 

A2B1C1C2 

A2B162C1 

Ai 

AIC2 

AiCi 

A1C1C2 

AIB2 

A1B2C2 

A1B2C, 

A1B2C1C2 

AiBi 

A1B1C2 

AiBiCi 

A1B1C1C2 

A1B1B2 

AIBIB2C2 

A1B1B2C1 

A\B\B 2 C\C 2 

A1A2C1 

A1A2B2C1 

AIA2BI 

A1A2B1C2 

A1A2B1C1 

A\A 2 B[C\C 2 

A\A 2 B\B 2 C\ 

1 



TABLE III: The types of terms allowed in a causal process matrix VPf 

{A,B)^C 

compatible with (A, B)^C. 

Cl 

Bi 

BIC2 

BlCi 

B1C1C2 

A2B1 

A2B1C2 

A2BICIC2 

Ai 

A1C2 

A,Cl 

A1C1C2 

A1S2 

A1B2C2 

A1B2C1C2 

AiBi 

AifiiC2 

A161C1 

A1B1C1C2 

A1B1B2 

A1B1B2C2 

A{B\B 2 C{C 2 

A{A 2 B\ 

A1A2B1C2 

A\A 2 B{C\C 2 

A1B2C1 

A2B1C1 

A1A2B1C1 

A1B1B2C1 

1 


Proposition III.l. An operator of the form (47) satisfies condition (46) if and only if in addition to the identity term it contains 
at most terms in which there is a nontrivial cr operator on Xi and a trivial one (the identity operator) on X 2 for some party 
Xe{A,B, C,---}. 


In the Appendix, we present Proof S4 of the above proposition for the case of three parties and the general case follows 
accordingly. From the analysis in Proof S4 we see that a general operator VT'^i'^ 2 ^i^ 2 CiC 2 contain up to 64 types of 
terms. The condition for normalization of probabilities (46) narrows the types of terms to the 38 types listed in Table II. The 
positive semidefiniteness condition (45) does not limit any further the allowed types of terms, because one can conceive of a 
positive semidefinite matrix containing nonzero terms of any chosen type (this can be ensured by taking the nontrivial cr terms 
with non-zero coefficients of sufficiently small magnitude relative to the weight of the identity term which is always fixed). 
Thus, an operator VT'^i'^ 2 ^i^ 2 CiC 2 ^ valid tripartite process matrix, i.e., it satisfies conditions (45) and (46), if and only if it 

satisfies condition (45) and contains only terms of the types listed in Table II, where the identity term comes with the weight 
woooooo = ■ In a similar way, one proves the allowed types of terms in the general n-partite case. (For an alternative 

formulation of the conditions for an operator to be a valid process matrix, see Ref. [19].) 

The types of terms that appear in the expansion of a process matrix are closely related to the signaling between the parties 
that the process allows. For example, a bipartite process involves signaling from Bob to Alice if and only if the process matrix 
contains terms of type A\B 2 or A\B\B 2 [4]. To state the condition for (no) signaling in the multipartite case, it is convenient to 
introduce the following terminology (see also Ref [19]). Consider a Hilbert-Schmidt term cr .' ®cr ®(Ti^ ■ ■ 

B C C 

as in Eq. (47). The restriction of this term onto, say, subsystems B 2 C 1 C 2 • ■ ■ is defined as cr^ - ® cr^' ® cr„^ ® • ■ •. 


Proposition III.2. An n-partite process matrix for a set of parties (1, • ■ ■ , n] does not permit signaling from, say, (1 and 2 and 
■ ■ ■ and k) to (k H- 1 and k + 2 and ■ ■ ■ and n) if an only if it contains only terms whose restriction onto 1 112 ■ ■ ■ ^ 1^2 are of the 
allowed types for a process matrix on (1, ■ ■ ■ , k] as described in Proposition III. 1 . The Proof S5 is given in the Appendix. 


As an example, a tripartite quantum process that is causal and compatible with a situation in which Charlie is first (Fig. 3) 
should involve no signaling from Alice and Bob to Charlie, and hence it can only contain the types of terms listed in Table III. 
These constraints on the allowed types of terms imposed by causal order will turn out to play an important role in the character¬ 
ization of the so-called causally separable quantum processes, which we define in the next subsection. 
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B. Causally separable quantum processes 

Given that quantum processes have a simple description in terms of process matrices, it is natural to ask whether the property 
of causality can also be expressed in terms of simple conditions on these matrices. Consider a bipartite quantum process for 
Alice and Bob, and assume that it is a hxed-order process compatible with the causal configuration A<B. In that case, as argued 
earlier, the only constraint imposed by causal order is that the process should involve no signaling from Bob to Alice. As pointed 
out in the previous subsection, there can be signaling from Bob to Alice if and only if the process matrix contains 

terms of type A 1 B 2 or AiBiZ? 2 - Therefore, a process matrix is compatible with A<B if and only if none of these types of terms 
appear in its expansion. This means that such a process matrix has the form 

^ ( 43 ) 

where > q (with = (Iat) contains at most terms of type 1, Ai, B\, A\Bi, A 2 B 1 , A 1 A 2 B 1 . (This is equivalent to 

saying that is a valid process matrix for the case where Bob has a trivial output system, "K*' = C'.) 

Similarly, in the case where A^^B, the process matrix has the form 


TfyrAMB _ ypAiB, g, (49) 

where W^'B' > 0, TrVT'^‘^‘ = 1. Such a process is realized in a situation in which Alice and Bob receive input systems in a joint 
quantum state with a density matrix W^'B', and their output systems are discarded. 

We can unify these two conditions to write down the form of a process matrix compatible with BM, which is identical to (48), 

^^BM ^ y^2AlA2i^, g, |B2^ (50) 

where is a valid process matrix for the case where = C'. 

As shown in Ref [37] within a different framework, all process matrices of the type (50) can be realized by embedding the 
experiments of Alice and Bob in a quantum circuit, so that Bob’s experiment does not precede Alice’s experiment in the order 
of the circuit composition. Most generally, this corresponds to providing Alice with an input system that is entangled with an 
ancilla, then sending Alice’s output together with the ancilla through a quantum channel into Bob’s input, and then discarding 
Bob’s output. Such a process is referred to as quantum ‘channel with memory’. 

As we have seen earlier, a bipartite causal process is one that can be written in the form (16), where AV^^b and are 

two processes compatible with A^B and BiA, respectively. It is then tempting to conjecture that the class of causal quantum 
processes might be those whose process matrices can be written in the form 

^a,A 2 B,B 2 ^ ^ ^ (i-q) wB^^, 0<q<l, (51) 

where W^^b and wB^^ have the form defined in Eq. (50). Certainly, since the probabilities for the outcomes in the quantum 
process framework are linear functions of the process matrix, a process matrix of the form (51) describes a causal process. 

However, the condition for a process to be causal (Eq. (16)) does not imply that and in the convex decomposition 

of the process should themselves be quantum process; only their convex mixture needs to be. While it is conceivable that the 
structure of quantum processes might imply the form (51) (indeed, this has been shown to hold for a limited class of bipartite 
quantum processes [14]), there is no obvious reason to expect this to hold in the general case. In fact, we will see that the natural 
generalization of condition (51) to the multipartite case is not equivalent to the condition that a process is causal (the same holds 
also for other possible generalizations that we will discuss later). Very recently, the same was shown to hold also in the bipartite 
case, by Feix, Araujo, and Brukner [39]. 

A bipartite quantum process that admits the decomposition (51) was called causally separable [4]. One way to think of the re¬ 
lation between causal and causally separable quantum processes is in analogy with the relation between Bell-local and separable 
(non-entangled) quantum states. Given an arbitrary multipartite quantum state with a density matrix the probabilities for 
the outcomes of a set of local POVM measurements {Mf}i^oA, (Mj]jeofi, • ■ ■ ( XieO'' 'Zijeo'> ■ ■ ■) given 

by 


p{i, B ■ ■ ■ \{Mf }i^o^ , {mB}j^o‘ ,...) = Trlp^'^ -Mf ® ® ■ ■ ■ )■ (52) 

A Bell-local state is one for which the joint probabilities for the outcomes of any combination of local measurements admits 
a local hidden variable description (and hence such a state cannot be used to violate any Bell inequality [40]), i.e., the joint 
distribution can be written as a probabilistic mixture of factorizing local distributions, 

p{o^,oB,---\s'^,sB,--- ,p^B) ^ p{A) p(o^\s^,A) p(oB\sB,A)--- , 

.1 


( 53 ) 
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where A is some variable with a probability distribution p(A), ■ ■ ■ are the local measurement settings (each corresponding 

to a specific local POVM measurement {Mj}j^o'^, ■ ■ ■), and o^, o®, ■ ■ ■ are their outcomes (corresponding to i, j, ■ ■ ■ 

in the expression (52)). A separable quantum state is one for which each of the local distributions A), p(o^\s^. A), ■ ■ ■ in 

Eq. (52) itself can be thought of as arising from the respective local measurement being applied on a local quantum state, which 
means that the density matrix of the state can be written 

■■■ = 2 ® ■ ■ ■ ■ 

A 

A separable quantum state is clearly Bell local, but the reverse is known not to be true [41]. The relation between causal (16) 
and causally separable (51) bipartite quantum processes can be seen in an analogous way - a causally separable process is one 
for which the processes into which we decompose the process are themselves valid quantum processes. 

Here, we propose to extend the notion of causal separability to the multipartite case based on this analogy. 


Definition III.l. (Causally separable quantum process): A quantum process is called causally separable if and only if it can 
be decomposed in the canonical form given by Theorem II.2, with the additional condition that each process on the right-hand 
side of Eq. (41) is a quantum process. (Note that since the canonical form is defined iteratively, the latter is understood to hold 
for all conditional processes in this definition.) 

By a direct analogy, causally separable processes can be defined for any theory formulated in the process framework, but here 
we will be interested specifically in quantum processes. The process matrix of a causally separable quantum process will be 
called a causally separable process matrix. 


C. Non-equivalence between causal and causally separable multipartite processes: a tripartite example 

We now give an example of a tripartite quantum process that is causal but causally non-separable, which demonstrates that 
these two concepts are not equivalent, at least in the case of more that two parties. A similar conclusion based on the same 
example has been obtained independently by Costa and is presented in Ref. [19]. 

The example is inspired by the idea of superposition of causally ordered quantum circuits by means of the so-called quantum 
switch technique [3], where the order of two black-box quantum operations is made to depend on the value of a quantum control 
bit prepared in superposition of the two possible logical values. Each of the input and output systems of Alice and Bob in our 
example will be assumed to be a two-dimensional (qubit) system. We can imagine that this is the spin degree of freedom of a 
spin-j particle, which enters each laboratory, interacts with the devices inside, and leaves. The particle could be prepared so 
as to go in superposition along two different possible paths - along one path, it goes first through Alice’s laboratory and then 
through Bob’s, whereas along the other path it goes first through Bob’s laboratory and then through Alice’s. Eor simplicity, we 
can imagine that the experiment is arranged in such a way that the particle would always go through Bob’s laboratory at a fixed 
time, but depending on the value of the control bit, it would go through Alice’s laboratory before or after that. It is assumed 
that independently of the time at which the system may go through Alice’ laboratory in a given run, Alice would apply the same 
operation on it. To understand the effect of such a setup, consider first the case in which Alice and Bob each apply a unitary 
operation on the system, Ua and Ub, respectively. Let us denote the Hilbert space of the control qubit (path degree of freedom) 
by '7/'’, and that of the system (spin degree of freedom) by If |0)‘’ corresponds to the path in which Alice is before Bob 
and 11)'’ to the path in which Bob is before Alice, if we initially prepare the particle in the state, say, = |'I')('I'|':^, where 
14 /)^^* = (alO)'’ H- j3\iy)\il,y, at the end it will be in the state p“ = |'I')<'T|“, where |'I')“ = a\0yWgW^\i/^y jS|l)'’t/|t/^|i/r>'. 
Now, if a third party, Charlie, performs an operation on the joint system '7/'' ®'7/® subsequently, he can distinguish this situation 
from a situation in which the order between the operations of Alice and Bob is conditioned on a classical bit (e.g., modeled 
by the initial state of the control qubit being in a ‘classical’ mixture of the two possible values, |ap|0)(0|’' -H |y6p|l)(l|^, instead 
of a coherent superposition) by performing a suitable measurement. In fact, it was shown in Ref [6] that by exploiting such 
a coherent strategy, Charlie can perfectly distinguish between a pair of unitaries and that commute or anticommute by 
using each of the unitaries only once, which is impossible if the order of the unitaries is conditioned on a classical bit. An 
experimental demonstration of this effect was recently reported in Ref. [17]. 

In the general case, the operations of Alice and Bob need not be unitary and may have different possible outcomes. Every 
such operation, however, can be seen as the result of a joint unitary on the input system and a local ancilla, such that the outcome 
remains stored on the local ancilla in a particular basis. Similarly, any local ‘choice’ of operation can be modeled by a larger 
unitary on all systems involved plus a local ancilla that carries the ‘choice’ variable. Thus, we can have Alice and Bob perform 
general operations in this setup by purifying their local operations to unitaries and deferring the reading of their outcomes to the 
end of the whole experiment. (Note that in order not to destroy the superposition, the whole experiments needs to be performed 
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coherently, which may be unrealistic for local operations performed by macroscopic devices, but is in principle compatible with 
standard quantum mechanics.) 

In our example, we will take a - fi - and we will assume, as described above, that Charlie can operate on both the path 
and spin degrees of freedom of the particle after it has interacted with Alice and Bob. In other words, Charlie’s input system 
will be four dimensional, and we will formally decompose it into two qubit subsystems, ® where and 

correspond to the path and spin degrees of freedom, respectively. Since Charlie operates last, we do not need to introduce 
a non-trivial output system for him, i.e., his output system will be assumed one-dimensional. The process matrix relating the 
local experiment of Alice, Bob, and Charlie in this setup can easily be obtained by describing the experiment in the form of a 
circuit in which Alice’s operation is represented by two controlled operations at two possible times, such that one of them would 
act nontrivially depending on the state of the control qubit (left diagram on Fig. 5). Using the CJ representation of the channels 
connecting the different boxes, we obtain 

'\^AiA2BiB2CiC2 _ |yj^^^yj^|AiA2BiB2CiC2^ 

where 

|W>a>A2'Bi«2CiC2 ^ (|0)C5|(/,)'4i|(i)+)^2Bi|q)+^B2q + |l)C;|^)iii|(D+)B2Ai|(j,+^A2q^/^2, (56) 

with = |00) -H 111). It can be verified that VT'^i^ 2 ^i^ 2 CiC 2 contains only allowed terms. This process matrix is a rank-one 
projector, and hence it cannot be written as a convex mixture of different process matrices. Therefore, if it is causally separable, 
it must be of one of the types or But each of these types of process matrices should permit no 

signaling from two of the parties to the third one (e.g., in the first case there can be no signaling from Alice and Bob to Charlie). 
However, the above process matrix permits signaling to any of the parties from some of the other parties. Indeed, to see that there 
can be signaling from Alice and Bob to Charlie, imagine that Alice and Bob choose to perform the unitary operations Ua and 
Ub- In this case, Charlie will receive the state [|0)‘'i(t/Bt/A|iA))‘'i -i- aU 3 ]^!^))^’']! V2, which can be different for different 

choices of the unitaries of Alice and Bob, and can therefore yield different probabilities for the outcomes of some measurement 
of Charlie. To see that we can have signaling from Alice to Bob or vice versa, notice first that there can be no signaling from 
Charlie to Alice and Bob (Charlie has a trivial output system). This means that we have a well-defined reduced process for Alice 
and Bob, whose process matrix is 

^AiA2 B,B2 ^ ® |(P+ )((1)+|'^^®‘ ® 1®^ -H ® )((I)+|®2"'' ® 1^=). (57) 

This is a causally separable bipartite process matrix that can be interpreted as describing an equally weighted probabilistic 
mixture of two fixed-order processes - the first one describes a situation in which the input state |i/f) is sent into Alice’s input, 
her output is sent into Bob’s input through the identity channel, and Bob’s output is discarded; the second one describes the 
analogous situation with the roles of Alice and Bob interchanged. Clearly, since in the first situation there is an ideal channel 
from Alice to Bob, there can be signaling from Alice to Bob in this process (even if imperfect on average), and similarly from 
Bob to Alice. Therefore, the process matrix given by Eqs. (55) and (56) is not causally separable. 

The fact that the process is causal follows immediately from the fact that the reduced process for Alice and Bob is causally 
separable (and hence also causal). Specifically, we have ° '^[Af ° 

But the tripartite process is simply = ^c|ab „ ^ab ^ i^c|ab „ „ ^a ^^lich 

is the form of a causal process. This observation suggests how the probabilities of Alice, Bob, and Charlie can be simulated 
without using a quantum switch, if we allow the parties to have larger input and output systems. Since the reduced probabilities 
of Alice and Bob can be realized by conditioning their order on a classical random bit, all that is needed in order for the tripartite 
process to be reproduced in this way is for Charlie to receive the information about the settings and outcomes of Alice and Bob 
so as to produce the necessary p{o^\s'^, o'^, i®, o®, s^). Therefore, if in addition to the qubit system that goes between Alice and 
Bob there is another (possibly infinite-dimensional) system on which each party writes down his/her setting and outcome (right 
diagram on Fig. 5), and this system at the end enters Charlie’s laboratory (or, alternatively, the state on Charlie’s original input 
system is prepared based on this information), the process can be simulated using classically random causal configurations. 

By a similar argument we can construct a large class of multipartite processes that are causal but not causally separable. 
Consider a situation in which the order of all but one of the parties is conditioned on the state of a control system prepared in 
superposition, and subsequently all systems on which these parties have operated together with the control system are sent into 
the input of the last party. If all systems were initially prepared in a pure state and all channels are unitary ones, the process 
matrix will have rank 1, and unless the process is fixed-order causal, it cannot be causally separable. Yet, it will be causal 
because the reduced process for all parties except for the last one will be causally separable (and hence causal) due to the fact 
that when we trace out the control system, the process for these parties would be a classical probabilistic mixture of fixed-order 
processes. Since the full process is obtained by multiplying the conditional process of the last party with the reduced process of 
the previous ones, the full process is causal. It can be simulated using classical control of the order of the parties by allowing 
larger input and output systems by which the settings and outcomes of all other parties are made available to the last one. 
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FIG. 5; The left diagram illustrates the circuit with quantum control. The right diagram illustrates a simulation of the same 
correlations with a classically controlled circuit using input and output systems of larger dimensions. 


D. Non-causality can be activated by shared entanglement 

We now show another peculiar property of the concepts of causality and causal separability of quantum processes. One of the 
key assumptions in the derivation of the quantum process matrix framework is that every process can be extended by supplying 
the parties with ancillary input systems in an arbitrary quantum state, yielding another valid process. Intuitively, since a joint 
input state is a non-signaling process that is compatible with any causal configuration, one may expect that by adding such a 
state to a causal quantum process would yield again a causal process. We now show that this is not the case. We refer to this 
effect as activation of non-causality. 

We give a particular example of a tripartite causal quantum process matrix, constructed on the basis of the bipartite process 
matrix presented in Ref [4], 


4 V2 " V2 

which itself can violate a causal inequality and is hence non-causal (see Ref. [4]). Here, the input and output systems of Alice 
and Bob are two-level systems. In our tripartite construction, the input and output systems of Alice and Bob are also two-level 
systems, and we add Charlie, who has a trivial input system and a two-level output system. In terms of the Pauli matrices cr^, 
CTy, cTj, the process matrix we consider has the form 


T^AiA2B^B2C2 ^ 1(|A.A2B.B2C2 _L(jAI(j-B,^B2^C2 + _L^2^B2^C2y 

4 V2 " " " ^|2 " " ^ 


(59) 


The fact that this is a valid process matrix follows from the fact that it has the right normalization, contains only allowed cr 
terms, and is positive semidefinite. The latter is easy to see by noticing that relative to the {|0), 11)) basis of system Bi (this is the 
eigenbasis of cr^ corresponding to eigenvalues H-1 and -1, respectively), the process matrix can be written 


W‘ 


'A\A2B\B2C2 _ 


|0><0|*> ® ^(11 


A\A 2 B 2 C 2 


1 


cr^'crf^cr^^ + 


H- |1)<1|^‘ ® i(]l^i^2«^C2 

V2 4 


1 




V2 

(60) 


Now, the operator i(]i'^i'^ 2 B 2 C 2 ^^M^B 2 ^C 2 ^^A 2 ^C 2 ^ identical to that in Eq. (58) except that we have the system C 2 

in the place of Bi, and this operator has been shown to be positive semidefinite. The operator ‘crf^cr^^ - 

^CTj ^cr^-) differs only by the fact that the nontrivial cr terms come with a minus sign, and can be obtained from the first operator 

by a unitary transformation (e.g., one that takes to -cr^^' and to - 0 -^% such as CTy*). 

To see that this process matrix describes a causally separable process, note that it permits no signaling from Alice and Bob 
to Charlie, i.e., it can be formally written as ^ conditionally on any event in Charlie’s laboratory. 
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which is most generally described by some CP map with CJ operator > 0, Alice and Bob are left with a bipartite process 
with process matrix 

= TrcJCM*^^ (8> (61) 

This process matrix is obviously a linear combination of the identity and terms containing only cr^ operators on different sub¬ 
systems, i.e., it is diagonal in a given local basis (the {|0),|1)) basis for each subsystem). It was shown in Ref. [4] that all 
such bipartite process matrices are causally separable (though we remark that the same was shown not to hold for multipartite 
processes [9]). 

Imagine now that we supply Bob and Charlie with the entangled input state 2 |<l)^)(<I>^|‘'i*i, which yields the new process 

^AiA2SiSjfi2CjC2 _ y^A\A2B\B2C2 I (62) 

If Charlie performs the identity unitary channel from C'j to C 2 in his laboratory, which is described by 
Alice and Bob are left with the bipartite process 

^A,A2B,B\B2 ^ 1(|A,A2BiB;B2 + J_o^1o-Bio-^'vB 2 + J_cr^2^Bi (53) 

4 V2 " V2 

This can be easily seen from the fact that taking the partial trace of jjjg operator |(I)+)((l)+|‘''i ‘'2 is formally 

identical (up to a normalization) to a local projection in a quantum-state teleportation protocol [38], which amounts to ‘teleport¬ 
ing’ the part of the matrix on C 2 onto B\. (Note that the standard notion of teleportation is defined for quantum states and not 
process matrices, and the protocol requires a correcting operation on the receiver’s side since a projection of the kind above, 
which does not require correction, cannot be accomplished deterministically [38]). The process matrix (63) is similar to (58), 
except that the local operators on B\ in the non-trivial sigma terms in Eq. (58) are now on B\, and there is a cr^ operator on B\ 
in each such term. This process matrix is non-causal, because it allows Alice and Bob to obtain any correlations that they could 
obtain using the non-causal process matrix (58). This can be done as follows. Alice always performs the same operations that 
she would perform with the process matrix (58). Bob performs a measurement on system Bi in the (|0), 11)] basis. If he obtains 
the outcome |0), then it is as if Alice and Bob share the process matrix (58) with B\ in the place of Bi. He will then apply any 
operation from to B 2 that he would apply from B^ to B 2 with the process matrix (58), which yields the same joint probabilities 
for Alice and Bob as those with the process matrix (58). If Bob obtains the outcome |1) for his measurement on Bi, then it is 
as if Alice and Bob share the same process matrix as (58) with Bj in the place of B\ but with a minus sign in front of each of 

the two nontrivial cr terms. This process matrix is equivalent to the previous one under a change of basis by the unitary (Ty\ 
Therefore, Bob can simply apply from Bj to B 2 the same operations he would apply from Bi to B 2 with the process matrix (58) 

but transformed by the unitary transformation erf'. Again, this yields the same joint probabilities for Alice and Bob as with the 
process matrix (58). In particular, Alice and Bob can use this strategy to violate the causal inequality described in Ref. [4]. The 
process matrix (63) is thus non-causal, and so is the tripartite process matrix (62). 

It is not known at present whether non-causal processes can be realized in agreement with the known laws of quantum 
mechanics without resorting to post-selection. We have seen in the previous subsection that we can realize causally non-separable 
processes, which are nevertheless causal. Here, we see that certain causal processes can become non-causal when supplied with 
shared entanglement. The ability to extend a process with shared entanglement seems natural to expect for any experimentally 
realizable process. From this perspective, this result suggests that either non-causal processes may be possible, or that there may 
exist causally separable processes, as defined above, that cannot be realized in practice. 


E. Extensibly causal and extensibly causally separable quantum processes 

The fact that according to our definition of causal separability there exist causal processes that may be activated to non-causal 
ones by shared entanglement naturally suggests the definition of the following classes of processes that do not have this 
counterintuitive property. 


Definition III.2. (Extensibly causal quantum process): A quantum process that is causal and remains causal under extension 
with input systems in an arbitrary joint quantum state is called extensibly causal. 


Definition III.3. (Extensibly causally separable (ECS) quantum process): A quantum process that is causally separable and 
remains causally separable under extension with input systems in an arbitrary joint quantum state is called extensibly causally 
separable (ECS). 
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The process matrices of these types of processes will also be referred to as extensibly causal and ECS process matrices, 
respectively. 

Note. These definitions can be formulated analogously for more general process theories that permit composite local systems. 

Do these classes of processes correspond to something easy to describe in practice, and are they different at all? It is 
immediate to see the following facts. 

Observation 1: All bipartite causally separable processes are ECS. This is because, if we add an arbitrary joint input ancilla 
to a process matrix of the form (51), we again obtain a process matrix of the same form. Therefore, the notion of extensible 
causal separability can be seen as another possible multipartite extension of the bipartite notion of causal separability, which, 
however, is linked in a less direct way to the theory-independent notion of causality. 

Observation 2: Extensibly causal and ECS processes are not equivalent in general. Indeed, the causally non-separable 
tripartite process (55) based on the quantum switch is also extensibly causal (our proof that it is causal applies also if the parties 
share entangled input ancillas). 

Comment: Recently, Eeix, Araujo, and Brukner gave an example of a bipartite quantum process that is causal but not 
extensibly causal [39], proving that causality and extensible causality are different in the bipartite case too. While in the 
tripartite case we have seen that extensible causality is also different from causal separability, it is currently an open problem 
whether the same holds in the bipartite case. 

In the next subsection, we derive a characterization o f the tripartite ECS processes in terms of conditions on the form of the 
process matrix which generalize the conditions in the bipartite case (Eqs. (50), (51)). 


F. Structure of tripartite ECS process matrices 

Recalling the definition of causally separable process, let us first state an obvious consequence of this definition for the 
structure of causally separable (though not necessarily ECS) process matrices. Since the probabilities of a quantum process are 
linear in the process matrix, the requirement that a causally separable process decomposes as in Theorem II.2 where all processes 
on the right-hand side of Eq. (41) are valid quantum processes means that a causally separable process matrix is one that can be 
written in the form 


/=1 f=l 

where - ,n)ii ^ process matrix which describes a process with the property 

- .'-i.'+i. - .nUi - TE'; ' ' '"I' o TE', (65) 

where for n > 1 the conditional process ■ • ■ "I' jg ^ causally separable process for every value of the event in /, and for 

n = 1 it is the trivial process. Note that the requirement that • is a quantum process that permits no signaling 

from the rest of the parties to i guarantees that both the reduced and the conditional process on the right-hand side of Eq. (65) 
are valid quantum processes (this can be seen from the (no) signaling condition in Proposition III.2). 

In the case of two parties, we have seen that the process matrices whose processes obey o'W^ (note that 

any monopartite process is trivially causally separable and ECS), are those that can be written in the form <S> ll'^^ 

and the general form of bipartite causally separable process matrices is (51). As noted already, this is also the general form of 
the bipartite ECS process matrices. Our goal is to obtain a similar conditionfor triparite ECS processes. 

Eirst, let us consider a process of the form o 'W^, where is a monopartite quantum process and 

is a bipartite conditional process which is causally separable for each possible event in C. Since in particular there 
should be no signaling from Alice and Bob to Charlie in such a process, its process matrix, which we will denote 
can at most contain the types of terms listed in Table III. These are the terms that do not permit signaling from Alice and Bob to 
Charlie according to Proposition III.2. 

We will first obtain necessary and sufficient conditions for such a process to be ECS. Note that we have not proven yet that 
a general ECS process matrix should have the form (64) where each of the terms is itself ECS. This will be 

shown later. 
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Every event in Charlie’s laboratory is described by some CP map with CJ operator > 0, < t/cr Conditionally 

on such an event, Alice and Bob are left with the process matrix 


■^A\A 2 B\B 2 


TtV ^ \TM^i^2BiB2CiC2 


(|AiA2B,B2 


( 66 ) 


where is the probability for the event to occur in Carlie’s laboratory (given the appropriate setting), which 

is independent of the operations performed by Alice and Bob since the process involves no signaling from Alice and Bob to 
Charlie. More specifically. 


^(^c,C2) ^ Tr[lE‘^‘‘^W‘^‘^^], 


(67) 


where 


CIa 2 ^B-> 


( 68 ) 


is the reduced process of Charlie. The requirement that the conditional process for Alice and Bob is causally separable means 
that for all 


^AiA2itli?2 

'^VciC2 






A^B 




(1 - qMC.C2)Wlf. 


(69) 


where valid quantum processes compatible with A^B and BiA, respectively, and qMC\C 2 e [0,1] (all 

objects generally depend on For convenience, we will write this simply in the form 


■^AxAiBiBi 






(70) 


~ A B B ~ A A B 

where 1 E^C|C 2 ^ - ^^d lE^c,c 2 ' ^ 0 , and the whole operator is a valid process matrix, i.e., it contains only allowed terms and is 

properly normalized. 

A sufficient condition for this to hold is that 


■^A\A 2 B\B 2 C\C 2 

{A,B):^C 


0 i^A\B\B2C\C2 _|_ ■^B2 0 ■^A\A2B\C\C2 


(71) 


where t^^i^i* 2 CiC 2 > q ^AiA 2 BiCiC 2 > q ^j.g some positive semidefinite operators, whose sum gives a properly normalized 
quantum process matrix containing only the types of terms listed in Table III. (We remark that each of > 0 and 

ITA 1 A 2 B 1 C 1 C 2 > Q contain terms that are forbidden in a process matrix, such as terms of type C 2 , but these terms have to 
cancel in the sum.) Indeed, we have 


Y^CiC2 > 


+ 


® W' 


'A\A 2 B\ 

M^1^2 


(72) 


where 


= Trc,c2[1^''‘^'*-‘^‘'^'(1'^'^'*- ®'^‘^'‘^')]/e(^‘^‘‘^') > 0, 


(73) 




(74) 


A A B B C C A A B B 

and it is easy to see that since ^ contains only the types of terms listed in Table III, ’ can only contain 


allowed terms. 

It is immediate to see that this condition is sufficient also for the process matrix W‘ 


to be ECS. This is because if 

WAiA 2 ^i^ 2 CiC 2 jjjg above properties, any extension vT'^iABiB 2 CiC 2 ^p'^'iB\c\^ where is a density matrix, also has these 


Aj A 2 fii B2C] C2 
(A,B)^C 


properties. 

We now show that the form (71) is also a necessary condition for an ECS process matrix compatible with (A,B)^C, which 
we will denote by The proof makes use of the ‘teleportation’ technique that we used in showing the activation 

of non-causality. Imagine that we supply Alice and Charlie respectively with ancillary systems Aj and C[ of dimension t/ci'^C 2 
each, which are prepared in the maximally entangled state /((ic,^fc 2)2 where Conditionally 

on Charlie performing a suitable operation and obtaining an outcome with CP map | 0 +)( 0 +|(‘^iC’ 2 )c;^ Alice and Bob 

will be left sharing a process matrix which, up to a normalization factor, has an identical form to that of but with 



26 


Aj in the place of C 1 C 2 . The requirement that this is a causally separable bipartite process matrix means that must 

be of the form (71). 

So far, we have only obtained necessary and sufficient conditions for an ECS process matrix compatible with 

(A, B)iC (and similarly for permutations of A, B, C). We next prove the general case. 


Proposition III.3. Every tripartite ECS process matrix can be written in the form 


w: 


1,2,3, (75) 


where contains only terms from table III and has the form (71), and analogously for and 

by permutation. The Proof S6 is given in the Appendix. 


The extension of this form to an arbitrary number of parties is left for future investigation. 


G. Processes realizable by classically controlled quantum circuits 

Bipartite ECS processes have a clear experimental realization. This raises the question of whether multipartite ECS processes 
can also be realized in practice, and if so, whether they correspond to a natural class of experimental procedures. (Note that in the 
bipartite case, ECS processes are equivalent to causally separable processes, but we have already seen that there are multipartite 
causally separable processes that can become non-causal under extension with entangled ancillas, and these do not have a known 
experimental realization.) Here, we will show that a particular class of processes which can be realized in practice, referred to 
as classically controlled quantum circuits, belong to the class of ECS processes, which is the smallest class of causal quantum 
processes that we have considered so far. Based on certain considerations, we furthermore conjecture that all ECS processes can 
be realized in this way (this is certainly true in the bipartite case). 

The idea of a classically controlled circuit can be thought of as falling within the paradigm of quantum lambda calculus with 
classical control [42, 43]. If we regard the local experiments of the parties as black-box operations, we may think that they 
are called, only once each, as part of a computation where at every time step a quantum operation is applied on some part of 
a quantum register depending on a classical protocol that may use as a variable the outcomes of past operations. If black-box 
operations are involved in such a computation, their outcomes cannot be directly used (they remain ‘inside the box’ until the 
end), but the order of subsequent operations of the circuit may nevertheless depend indirectly on the event inside such a black 
box, since it can be decided based on a measurement on the output system. 

More concretely, we define such a process to have the following general realization. We begin with some sufficiently large 
quantum system (or ‘register’) in a given quantum state. We perform a quantum operation on it and conditionally on the outcome 
of that operation we determine which party will be first, which subsystem of the register will be his/her input system, and what 
operation will be applied after the black box of that party, all according to some specified rule. We apply the black-box operation 
of the first party on the decided subsystem, perform the decided operation after it, and depending on its outcome and the outcome 
of our first operation decide which party will be second, and so on. This continues until all parties are called (by definition, the 
protocol is such that each party is called exactly once). This model can be formalized in different equivalent ways, which may 
be suitable for different purposes, and we will consider some simplifications below when we discuss a tripartite example. The 
fact that this model gives rise to valid quantum processes can be seen from the fact that if we formally write the operation inside 
each box and calculate the joint probabilities for the outcomes of all boxes using the standard rules of quantum mechanics for 
all possible outcomes of the protocol, we see that they are linear and non-contextual functions of the respective CP maps of the 
parties. The same holds if we introduce ancillary systems prepared in an arbitrary state and consider extended operations of the 
parties that act on parts of them. 

In the case of only two parties, we know that any (extensibly) causally separable process can be implemented in this way, 
since it most generally corresponds to embedding at random the local experiments of Alice and Bob into one of two possible 
fixed circuits, which can be chosen conditionally on the outcome of a measurement on some state at the very beginning. Since 
after the first party is chosen there is only one possible choice for the second party, no measurement after the first party is needed. 
Reversely, any bipartite process that we may obtain via this model has the form of an ECS process. Eist notice that the process is 
independent of the operation applied after the last party. Also, the outcome of any operation after the first party can be ignored 
since there is only one choice for the last party, i.e., that operation can be assumed deterministic. Einally, the outcomes of the 
operation before the first party can be grouped into two coarse-grained outcomes such that conditionally on one of them the 
first party is Alice and on the other one it is Bob. But since after the outcome of that operation and before the input of the first 
party the quantum register is in some particular quantum state, the rest of the experiment simply corresponds to a deterministic 
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circuit in which Alice and Bob are embedded in a particular order. Therefore, the process realized by such a procedure is just a 
probabilistic mixture of the processes of two fixed-order circuits, which is the claimed form. 

In the case of more than two parties, the equivalence between the two concepts is less obvious, but we can easily argue 
that all processes obtained by classically controlled circuits are ECS. First, it is clear that depending on the outcome of the first 
measurement (which has a probability independent of any future operations and therefore of the settings of the parties), there will 
be one party that is first and hence the subsequent process that results from the protocol can involve no signaling from the rest of 
the parties to that first party. Therefore, the subsequent process has a well-defined reduced process for the first party. Taking into 
account all possible outcomes of the first measurement, the whole process will be just a probabilistic mixture of processes of 
this kind where one party is first, which is Eq. (64). But conditionally on the outcome of the first party, the procedure for the rest 
of the parties looks analogously, so Eq. (65) holds too, i.e., the process is causally separable. Including ancillas onto which the 
operations of the parties can be extended does not change anything in this argument. Therefore, every process realizable with a 
classically controlled quantum circuit is ECS. 

We conjecture that the reverse also holds. We provide some partial considerations that support this conjecture, based on 
analysis of the restrictions on the allowed terms in processes realized by classically controlled quantum circuits in the tripartite 
case. We will focus on the question of implementing by a classically controlled quantum circuit an ECS process matrix of 
the type which has the form (71). Implementability of a matrix of this kind is both necessary and sufficient for 

the implementability of a general tripartite ECS process matrix as described in Proposition III.3, since by using a suitable 
measurement at the beginning we can select with the right probability which of the three process matrices in the mixture on the 
right-hand side of Eq. (75) to realize subsequently. 



FIG. 6; Realization of an ECS process compatible with (A, B):^C by a classically controlled quantum circuit. 


The protocol begins most generally with some quantum system prepared in a state p. After Charlie operates on some sub¬ 
system, we apply some operation based on whose outcome we determine who is second, on what subsystem he/she would act, 
and what operation will be applied after that. Note that without loss of generality we may assume that there is a pre-specified 
subsystem on which the second party will operate since any subsystem of the same dimension can be mapped onto the desig¬ 
nated subsystem by a unitary transformation that can be absorbed as part of the definition of the present operation. Also, without 
loss of generality we may assume that this operation has only two outcomes, since we can group the outcomes into those for 
which Alice will be next, and those for which Bob will be next, and any conditioning of the operation following the next party 
on the fine-grained outcome within each group can be equivalently done by a single future operation acting on a larger system 
that includes some subsystem on which the classical information about the outcome at this step is copied (still something that 
we can include as part of the definition of the operation at this step). Since there is only a single possibility for the last party, 
the operation after the second party can be regarded as a deterministic operation (or a CPTP map) from all systems to the input 
of the last party. We leave the possibility that this last operation may be defined conditionally on the first outcome rather than 
absorb the conditioning on that outcome into a larger operation, in order to avoid complications arising from the fact that the 
different parties may have input and output systems of different dimensions. The outlined procedure is sketched in Fig. 6, where 
the two possible sequences of transformations arising from the two possible outcomes of our first operation are depicted in blue 
and green, respectively. The two CP maps corresponding to the outcomes of the operation after Charlie must sum up to a CPTP 
map, since they correspond to the two possible outcomes of a standard quantum operation. 

Each of the two possible developments (blue and green) of this protocol is a non-deterministic linear supermap [44] from 
the local CP maps of the parties into the real numbers, the result of which equals the probability for the particular sequence of 
events. This can be written in a similar form as the formula for the probabilities of the outcomes of the parties in a valid process. 
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~ A A B B C C 

except that in the place of the process matrix we would have an operator IV. ‘ ^ ‘ ^ > 0 , where i = 1 , 2 , labels the particular 

development, which generally would not be a valid process matrix. However, + ^MA2BiB2CiC2 _ ^Ai^BiftCiC2 

would be a valid process matrix realized through this classically controlled quantum circuit. 

Consider now just one of the two possible developments, say, the blue one, in which Alice is second and Bob is last (labeled by 
1 ). One can see that since Bob is last and his output system is discarded, we have Ty^iABiB2CiC2 _ ^B2 ^ ^AiA2BiCiC 2 
in the other case we have t^^1"^2BiB2CiC2 _ |A2 ^ i^MBiB2CiCiy jf j-jjg transformation Ni^cptp after Alice was not 

required to be CPTP but could be any CP map Ni^cp, for a suitable choice of the initial state p and of the CP maps Mxpp and 
N\pp we could realize any > q xhis is simply because we can choose the density operator proportional to 

^AiA2BiCiC 2^ where the part of on A1A2B1C2 is stored on C, and we can ‘teleport’ this part of the operator onto its 

desired subsystem by using CP maps Adi,cp and Afi,cp that have CJ operators proportional to projectors on maximally entangled 
states as needed to realize the ‘teleportation’ (the traces of these CP maps can be chosen to ensure the overall trace of the 
resultant operator However, the restriction that the transformation after Alice is trace-preserving, Afi,cppp, places 

constraints on what kind of obtained. Indeed, the CJ operator of Nipppp cannot contain terms of type A2, 

A'A2, and A'. Considering the calculation of qjj jj^g (^j operators of p, Mipp and N\pp, we see that the 

lack of these types of terms in Ni^cptp implies the lack of any term with a nontrivial cr on A2 in This is the only 

constraint on the possible types of terms in tx^i'‘'2BiCiC 2 possible types of terms are exactly those allowed in the operator 
IVA1A2B1C1C2 jjj £q Similarly, we see that the allowed terms in pp^i^i®2CiC2 (gob second, Alice last) are the same as those 

in tX'^i^i^2CiC 2 jjj £q These are the terms allowed in a process matrix compatible with Charlie being first, except that 

both jp^i'‘'2BiCiC2 (yA,PiB2CiC2 contain terms of type C2 and C1C2 . The fact that these terms should cancel in the sum 
® (yAiA2BiCiC2 |A2 igj ^AiBiP2CiC 2 _ ^Ai^Bi^B2^iC2 fogows fi'om the fact that this is a valid ECS process, and can be seen 
to be ensured by the requirement that M\pp + M2,cp is CPTP. 

The only restriction on the operators and imposed by this model, apart from their positive- 

semidefiniteness and the normalization of their sum, seems to be the absence of the forbidden terms in each of them, as well as 
of the forbidden terms in their sum. If this is indeed the case, then any ECS process could be realized by a suitable classically 
controlled quantum circuit. A strictly rigorous proof requires showing that apart from the lack of these forbidden terms, there can 
be no other hidden constraints on the pair of operators 1 * (g ^a,A2B,CiC2 |A2 ^ ^MB,B2CiC2 gf course, are guaranteed 

to be properly normalized). One way of doing it could be by exhibiting an explicit constructive procedure for implementing any 
given ECS process, which would be of additional interest on its own right. We leave this question, and the multipartite case, for 
future investigation. 


IV. CONCLUSION 

In this paper, we proposed a rigorous definition of causality in the process framework [ 4 ], which takes into account the fact that 
the causal order between a set of local experiments may in general be random and correlated with the settings of some of them. 
We derived the stracture of causal processes permitting such ‘dynamical’ causal order in the general multipartite case, which is 
captured by an iteratively formulated canonical form expressed in terms of reduced and conditional processes. The canonical 
form can be interpreted as an unraveling of the process into a sequence of local experiments, which agrees with the condition 
that the order and outcomes of the experiments prior to a given step is independent of the settings of future experiments. We 
showed that for any fixed number of settings and outcomes for each party, the probabilities of a causal processes form a polytope, 
referred to as the causal polytope. The facets of this polytope define causal inequalities, whose violation by a given process can 
be interpreted as demonstrating the non-existence of causal order between the local experiments. 

We investigated this concept and the related concept of causal separability in the quantum process theory introduced in Ref [ 4 ], 
whose properties were detailed here in the multipartite case. We proposed a definition of causal separability, which reduces to 
the one for the case of two parties [ 4 ], based on the canonical form of causal processes. Specifically, a causally separable 
quantum process was defined as a causal quantum process that has a causal decomposition such that the different processes 
appearing in this decomposition are themselves valid quantum processes. We showed that the set of causally separable quantum 
processes is strictly within the set of causal quantum processes, by exhibiting an example of a tripartite process that is causal but 
not causally separable. Very recently, the same was shown to hold also in the bipartite case [ 39 ]. We also gave an example of 
a causally separable (and hence also causal) process that becomes non-causal when extended by supplying the parties with an 
entangled ancillary state. Based on this observation, we proposed two extended notions of causality and causal separability called 
extensible causality and extensible causal separability, which require preservation of the respective property under extending the 
process with entangled input ancillas. Although they are different in the general case, the sets of causally separable and ECS 
processes are equivalent in the bipartite case. We showed that the sets of extensibly causal and causally separable processes 
are different in general via the same tripartite example that we used to show that causal and causally separable processes are 
different. At present we do not know if the same separation holds in the bipartite case. However, it was recently shown that 
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FIG. 7: A Venn diagrammatic sketch of our present knowledge of the different sets of quantum processes that we have 
introduced, in the general multipartite case and in the bipartite case. The white segments are non-empty. The gray segments are 

sets for which at present we do not know if they are empty or not. 


causal and extensibly causal processes are different in the bipartite case, similarly to the multipartite case [39]. 

Finally, we derived a simple characterization of the ECS quantum processes in the tripartite case in terms of conditions on the 
form of their process matrices, which extends the conditions for (extensibly) causally separable process matrices in the bipartite 
case. We conjectured that the set of ECS processes is equivalent to the processes that can be obtained within the paradigm of 
classically controlled quantum circuits and provided evidence for this based on analysis of the restrictions that this paradigm 
imposes on the tripartite process matrices it can create. The ECS processes and the processes obtainable by classically controlled 
quantum circuits are equivalent in the bipartite case. 

Our present understanding of the relation between all these different classes of quantum processes is illustrated for the general 
multipartite case and for the bipartite case in Eig. 7a and Eig. 7b, respectively. An obvious open problem is whether the gray 
segments in these figures are empty or not. 

Another problem of fundamental importance is to understand the class of quantum processes that are physically admissible 
in agreement with the known laws of quantum mechanics, and where this class stands with respect to all of the above classes. 
Are the processes that can be realized by classically controlled quantum circuits all the physically admissible causally separable 
processes? Where does the class of quantum-controlled quantum circuits stand? At present, this is the most general operationally 
feasible paradigm that we are aware of and all known processes realizable through it seem to be extensibly causal. Could the 
class of extensibly causal processes be equivalent to quantum-controlled quantum circuits? And most intriguingly, are there 
physically admissible non-causal processes? 

The implications of our results are not limited to the subject of indefinite causal order in quantum mechanics. They can 
be useful also for the problem of inferring causal structure [24], both in classical and quantum theory [45]. The subject of 
causal inference concerns many disciplines, from philosophy and machine learning to sociology and medicine. Our formulation 
of a background-independent operational notion of causality that admits dynamical causal relations opens the road to a more 
general paradigm for causal inference than the one assuming deterministic underlying variables and static causal relations [24]. 
The decomposition of causal processes derived here implies constraints on the possible causal orders compatible with given 
setting-outcome correlations, which can serve as a basis for developing more sophisticated causal inference tools. 
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Appendix: Causal and causally separable processes 


Proof SI. Proposition II.2. The ‘only if’ part is contained in the very Proposition II. 1. To prove the ‘if’ part, take an arbitrary 
experiment, say, 1 . Let {2, • • • , k}, up to relabeling, be the set of local experiments that are in the causal past or causal elsewhere 
of 1, and {k + 1, ■ • • ,n) be the set of local experiments that are in the causal future of 1. Since the causal configuration of the 
local experiments is assumed fixed, the condition for the process to be causal reduces to the requirement that for every such 1, 
we have p{o^, • • • , - ■ ■ , s") = p{o^, ■ ■ ■ , o''\s^, • • • , i"). But from the transitivity and anti-symmetry of causal order it 

follows that none of the experiments {1, • • • ,A:) is in the causal future of any of the experiments {k + 1, • • • ,n). This implies 
that we have a reduced k-partite process for {1, ■■ ■ ,k}, i.e., p(o',-" , o^|s', ■ ,s") = p(o^,--- , s*). The desired 

condition then follows from Proposition II. 1 applied to the k-partite process. 

Proof S2. Proposition II.3. First, observe that the property (18) holds for the case where the specified K consecutive sets 
exhaust all local experiments {1, • ■ • ,n]. This is because, in this case, each of the local experiments in the K'* consecutive set is 
causally preceded by or causally independent from every other local experiment. Hence, the definition of causality (2) directly 
implies the desired relation. The general case follows by induction from this special case and the following Lemma. 

Lemma SI. Let the property (18) hold for K = K' H- 1, where K' > 1. Then it also holds for K = K'. 

Proof. Observe that 


P(k{Ii,--- ,nK'), [li,--- ■■■ ,[1 

>«K'),[li,-' - ,[1 k' 


K',--- , 0 «K' 


(SI) 




where the sum on the right-hand side is over all sets of local experiments that can be the (K' -H I)'* set when the first K' consecutive 
sets are the specified ones. If Eq. (18) holds for K = K' H-1, all terms in the sum can depend non-trivially only on the settings of 
the parties in the first K' consecutive sets, and hence the same must hold for the quantity on the left-hand side: 

piKih,--- ,nK'),[li,--- ,/) (S2) 

^ pWi,--- ,nK'), [li,-- - ,[1 kS"- 


What remains to be shown is that this probability cannot depend on the settings ^... ^ i"K' 

Note that here we cannot apply straightforwardly the causality condition (2) as we did in the case when the first K' consec¬ 
utive sets were assumed to contain all local experiments. This is because for a particular causal configuration /<'t(li, ■ ■ ■ ,nK') 
compatible with [Ij, • ■ • , ni]^ ■ ■ ■, [Iks ■ ■ • , hk']^ . it is generally not the case that 

P(k.(1i,"- ,nK'),[li,--- JIkS'" • • • , |i', • • • , i") 

= p(Af.(li, • • • ,nK')>o’‘. • ■ • , s"). (S3) 


Indeed, in order for the first K' consecutive sets to be the specified ones, it is necessary and sufficient that: 1) the local experiments 
in the specified K' consecutive sets have a causal configuration compatible with these sets, and 2) each of the local experiments 
that are not in the specified K' consecutive sets is in the causal future of at least one of the local experiments in the 
consecutive set. [In the case where the K' sets were assumed to contain all local experiments, only condition 1) was relevant 
and hence the equality (S3) held.] Consider a particular causal configuration Kt(li, • • • , n^j) compatible with [li, • • ■ , ni]^ • • •, 
[Ik', • • • , wk']^ (when the causal configuration /c(li, • • • .hk') in the probability on the left-hand side of Eq. (18) is not compatible 
with the specified consecutive sets, that probability is trivially zero). Let us denote by I rest,' " , Irest, I - n - 2k=i Ijm=i 
rest of the local experiments, i.e, those that do not belong to the assumed first K' consecutive sets. We have 

P(k.(1i,"- ,nK'),[li,--- .[Ik','" , o ^',■ ■ ■ , o'?"'^ 

= p(/C.(li,-" ,n^'),(\^'<\rest'^ ■■■ V n^<\rest),--- <lrest ^ <lrest), 0^\ ■ ■ ■ , | ^1 ^ . . . ^ ( 54 ) 


We will show that the probability on the right-hand side can be written as a linear combination of probabilities for which the 
condition of causality (2) straightforwardly implies independence of ,... , 

To this end, we write 


P(k.( 1 i, ,nK'),(l K' ^ 1 rest 

= P(k*(1i,--- ,nK'),o'',-" 


V ■■ ■ V nK'<lrcJ, - •• ,{'^K’<lrest V ■■ ■ V <lrest), ■ ■ ■ , 0 *K'|s'," • , s") 

,• •• , i") - p(/C.(li, • • • ilrest) V • ■ ■ V {'K' Hrest), 0^\ ■ ■ ■ - , i"), 
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where TC' = {Iks ■ ■ ■ ,hk')^ and iX means 1 k'^/ A ■ ■ ■ A 

By the definition of causality, the term p{Kt{li, ■ ■ ■ ,nK'),o^\ ■ ■ ■ , ■ ■ ■ , s’') on the right-hand side is in¬ 
dependent of • • • , i"K'. We need to prove that the second term is also independent of ^ ^«k' 

Observe that the proposition “{'K'ilrest) V V {‘K'ilrest)” is true when a proposition of the following kind is 
true; for some nonempty subset of {Irest, ■ ■ ■ Jrest), say, {Irest, ■ ■ ■ , frest), 1 < r < /, define the proposition 
“CK'ilrest) A ■ ■ ■ A ('K'iryest) A -'[C7C'^(r H- \)rest) V ■ ■ • V {'K'ilrest)T■ The different nonempty subsets of {1^^,, • • • , lyest) yield 
different such propositions that describe a complete set of mutually exclusive scenarios for which ‘X'K'iXyest) V ■ ■ ■ V CK'ilrest)” 
is true. Therefore, the probability ■ ■ ■ , mrOj i'K':^lrest) V ■ ■ ■ V {'K'ilrest), , o^’^' |s', ■ ■ ■ , s") is a sum of probabili¬ 
ties of the form p(a;,(1i,--- ,nK’),('K'ilrest),-■ ■ ,('K'irrest),^[('K'i(r + V ■■■ V ('K'ilrest)],o^\-■ ■ , s"), 

up to relabeling of lyest,"' Jrest, where 1 < r < f. But every such probability can be further written as 

/7(/C,(li,--- irrest). | • • • , Jf") - , nK')A'K'rest). " ' A"K' irrest)AA<' iA + 

V • ■ • V AK ^Irest). O ' ' ' , \s , ' ' ' , S ). By the definition of causality, the first of these terms is independent of 
/s+i)k' ,,, ^ i«K' Considering again the different realizations of “(‘7C'5<(r + V)rest) V ■■■ V i'K'ilrest)" by propositions of the 
form + \)rest) A ■ ■ ■ A {X<'i(r + q)rest) A -^[{X<'^{r + q + \)rest) V ■ ■ ■ V {'K'ilrestW for 1 < q < I - r (up to relabeling 

of the local experiments), the probability in the second term can again be written as a sum of probabilities of the form 
p(/c.(li,-" ,n^'),{X<'i\rest),- ■ ■ ,{'K'irrest),^[{'K'i(r+\)rest)y---y{'K'ilrest)],o^\--- , o«K'I*',''' , s"), where uow r is strictly 
larger than the one in the previous step. We can continue this for every new term until we reach r - l.ln this way, the probability 
p(/Ct(li,'-' , mkOj [li> ■ ■ ■ ■ • • , |s*, ■ • • , s") is decomposed entirely into a linear combination of 

probabilities of the form p(/c»(li, ■ ■ • , hk')^ i'K'i^rest), • • • , if rest), o'S • • • , ... ^ 1 < r < 1, which by the definition 

of causality are independent of ^... ^ i”K' -pjjjs completes the proof of Lemma SI. 

Proof S3. Theorem II.1. The necessity of the form (40) follows from Proposition II.3. Indeed, let and denote the 
collection of settings and outcomes, respectively, of the local experiments in a subset AI c <S. In terms of the set of parties X 
that are first, the probabilities of a causal process 'Wf can most generally be expanded 

p^(o^\s^) = ^ p([3(] V^)p(o'^k^, [Xf)p(o^^^\s^^^, o^, [3(]'). (S5) 

A’c.S.A’#!) 

But, as noted earlier. Proposition II.3 implies that p([<T]V'^) = p(['^]^). and that p(o'^\s'^, [/T]^) = p([(Y]\o‘''|i'^)/p([A’]*|i'^) = 
p([(Y]\ o^\s^)!pilX}^) - p^^{o^\s^, [/Y]*) are the probabilities of a non-signaling process for X. We therefore have 

^ o (S6) 

A'c.S,A',t|) 

where px - p([A’]^). Next, if ^ 4 >S, we can similarly expand the probabilities of the process in terms of the set of 

parties J/ that are second: 

xp(o'S\('YuJ/)|o-^, [<Y]’, [J/]“). (S7) 


Again, from Proposition II.3 we 
pim\ o^\s^)lp{[X]\o^\s^) 
p{[X]\ [J/]“, o^, o^\s-^)/p{[X]\ [J/]“, o'^li'5) 


have that p([J/]“|s'^,o'^, [-Y]') = p([A]^ [J/]“, o'^ls'^)/ p([A]', o'^ls'^) = 


p([J/]“k'^,o'^,[A]*). 


Similarly, p(o'^\s^, o^, [J/]^^) 




p(m\ o^,o^\s^, s^)/p([X]\ [J/]“, 


Pnsio- 


, o''^, [A]\ [J/]*^) are the probabilities of a non-signaling process for J/ for each value of (s'^,o^). In other 


words, we obtain that for each value of the events (s'^, o^) in X, has the form (S6). The argument is completely 

analogous for the next conditional process that appears, ^•s\(A’uj/)|.yuj/^ which, if nontrivial, can be expanded in terms of the 
different possibilities for the third consecutive set, and so on. This can be continued until we reach the last consecutive set in 
every possible grouping of the parties into consecutive sets, which proves the necessity of the form (40). 

To prove sufficiency, we will show that if every process of the form (40) is causal for !<«<«', then the same must hold for 
n = n' H- 1. The general case then follows by induction from this and the fact that a monopartite (n = 1) process, which has the 
form (40), is causal. Let an n'-partite process have the form (40), i.e., its probabilities can be written 


piK(S),0^\s^) = PxPnsi0^\s^)pi0' 


,S\Xi^S\X 






(S8) 


A’c.S.A’#!) 


where the probabilities p(o'^''^\s^''^, s^, o^) describe a conditional process which, if non-trivial, has an analogous form 

for every possible value of (s^, o^). Such a conditional process is therefore causal for every possible value of (s'^, o^) according 
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to our assumption. This means that there exists a probability distribution where k(S\X) is the 

causal configurations of the experiments in S\X, such that p(a'(>S\A’), s''‘, o''‘) = which 

for every fixed {s^, o^) obeys the causality condition (2). We want to show that there exists a distribution p(k(S), where 

k{S) is the causal configuration of all experiments S, such that 2,^(5) p{k{S), = p(o'^|i'^), which also obeys the causality 

condition (2). The following distribution will be shown to satisfy these desiderata: 

p(K(S),o^\s^) = 

for k(S) = W*S\<Y); i<j, V/ eX,Vje S\X-, iitj, Vi, j e X], 

p{K(S),o^\s^)^0 (S9) 

for k(S) + i<j, V/ eX^je S\X-, iitj, Vi, j e X]. 

According to this distribution, px - p{{X^), and the causal configuration of all local experiments for [A]* 

is always such that each of the local experiments in X is in the causal past of all local experiments in S\X, 

while the probability for the causal configuration and outcomes of S\X given the events in X and the set¬ 
tings in S\X is p(a:(.S\A’), i"'', o"^). The distribution /7(/c(kS), has the correct marginal p(o'^|i'^) 

by construction. To show that it satisfies condition (2), we will show that p(A-(kS),o'^|i'^, [A]*), which equals 

i‘^, 0 ''^) for k{S) - [K{S\X)\i<j,'ii e A, Vy e S\X;iMj,'^i,j e <^1 and vanishes oth¬ 
erwise, satisfies this condition. The fact that the whole mixture p(k(S),o'^\s^) = ['^1*) 

satisfies it then follows from the linearity of the condition. Consider a given local experiment / 6 S\X. Let 

X' c X and J/' c (S\X)\L We have p(K(X\J/',r)JiX',liJ/',o'^',o^'\s^) = p(k(X' ,J/', 0 ^^’,o^'\s^) = 
Pns(o'^'\s'^')p{K{^',l),li^',o‘^'\s‘,s‘^',s'^,o'^) for K(X',}f',l) - [/c(J/', Oi Vi e X','i j € J/' U j e X'], 

and p(K(X',if',l),l:i:X',mf',o‘^',o‘^'\s'^) = 0 otherwise. But from the fact that p(k(S\X),o'^^'^\s'^^^ , , 0 “^) satisfies 

condition (2), it follows that p{K(y,r),li}f',o'^’\s',s'^',s^,o^) - p(K(y,r),li}f',o'^'\s^',s^,o^). This proves that 
P(k{X' ,}/',r),liX' Jiif', 0 '^',o^'\s'^) - p(k(X' ,}/',l),liX',1:^', 0 '^'which is condition (2). Similarly, if we take 
I e X, consider two arbitrary subsets X' c X\l, J/' c S\X. When J/' V {), we have p(k(X' ,}/',l),liX' ,cX' ,o^'\s'^) - 0. 
When J/' = {), we have p(k(X' ,}/',l),liX', 0 “^',o'^'\s'^) - p(k(X' ,l),liX',o‘^'\s'^) - Pns(o‘^'\s‘^) - p„s{o‘^'\s'^'), which 
again proves that p{K(X',^',l),l^X',li^',o'^',o^'\s^) - p{K(X',^',r),[:^X',li^',o'^' i.e., we have seen that 
condition (2) is satisfied for every 1. This completes the proof of Theorem II. I . 


Proof S4. Proposition III.l. The proof follows the idea of the proof for the bipartite case in Ref. [S4]. Here, we detail it for 
the case of three parties. The «-partite follows analogously. 


Expanding the CJ operator of a local CP map in the Hilbert-Schmidt basis, r^v 6 R, we observe that 

the trace-preserving condition Tr^^M^'^- = is equivalent to the requirement roo = r,o = 0 for i > 0. Thus, CJ operators 
corresponding to CPTP maps are positive semidefinite operators of the form 




1 

dXn 




Xi X2 


i>0 


i,j>0 


Oi, tij e. 


(SIO) 


It turns out that condition (46) can be equivalently imposed only for operators of the form (SIO) without the constraint 

^XiX 2 > Q Qearly, an operator vT'^iABiB 2 CiC 2 - satisfies Eq. (46) for all operators of the form (SIO) satisfies 

Eq. (46) for positive semidefinite operators of this form in particular. The converse follows from the fact that any 

operator of the form (SIO) can be written as a real linear combination of positive semidefinite operators of the form 

(SIO): , where > 0 satisfy (SIO) for all i, and o'; G R, Vi. We will use this fact to recast 

condition Eq. (46) as a statement about the types of non-zero terms in the Hilbert-Schmidt expansion of - 

In the case of three parties, the expansion of vT'^ 1 '^ 2 BiB 2 CiC 2 j-gads 


y^A\A2BiB2ClC2 


z 


A] A2 Bi B2 Cl C? 

■ 'cr,V,,‘cr, 




Wijkimn e R-, Vi, j, k, I, m, n. 


(511) 

(512) 


Let us fix 


dA2 


and M^'^- 


^82 


, and 


consider an arbitrary of the form (SIO). Condition (46) becomes 


-q’j.[ypAiA2BiB2CiC2(-|AiA2 ig JB 1 B 2 g, (]lCiC2^ 

2 c„cr^^ + J] tr^^))] = 1, (S13) 

n>0 mn>0 
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TABLE SI: The types of terms that are forbidden in a tripartite process matrix 


C2 

C1C2 

B 2 

B2C2 

B2CIC2 

BIB2 

B[B 2 C 2 

B1B2C1C2 

A 2 

A2C2 

A2C1C2 

A2B2 

A2B2C2 

A2B2CIC2 

A2B1B2 

A2S1 B2C2 

A 2 B\B 2 C\C 2 

AIA2 

A1A2C2 

A1A2C1C2 

A1A2B2 

AIA2B2C2 

A1A2S2C1C2 

A1A2B1B2 

A\A 2 B[B 2 C 2 

A{A 2 B \B 2 C \C 2 




which, using the expansion of the process matrix, becomes 

dAidB^dcS'^Omm + ^ W00000«C„ + ^ W0000mnPm«) = 1, 


«>0 


mn>Q 


Pmn ^ 


(S14) 


This implies wqooooo 


d^^dB^dc^ 


and Wooooon = vvoooomn = 0, Vm, n > 0. 


ll A j A2 

Likewise, by fixing and - 


wooo /00 = woowoo = 0 for all k,l > 0, while by fixing M 


H c^c-) 

^—, and considering an arbitrary of the form (SIO), we obtain 

11 

B1B2 - J_ 


|SiB2 


dB2 


r r H A A 

and ^—, and considering an arbitrary M ^ ^ of 


the form (SIO), we obtain wojoooo = h'/voooo = 0 foi' all /, j > 0. 

-d A1A 2 

Now, if we fix only —, and we use the previously obtained constraints, we obtain wooo/o« = vvoowon = woootan = 

wookimn - 0 (each of these coefficients can be shown to vanish by suitably choosing the parameters in and in order 

R R 11 

to select only the term with that coefficient). Then, if we fix M ' - - ^—, we obtain wojooon = '^OjOOmn = '^ijcoon = WijtXmin - 0. 


d Cj C 2 

Similarly, if we fix ^ ^e obtain wojozoo = = w, 70/00 = ^, 7^,00 = 0. 

Finally, we impose condition (46) for arbitrary and of the form (SIO). Using the constraints obtained 

from the special cases above, we obtain wqjqiqh — ^ojoimn — ^ojkion — ^Ojkimn — ^ijoion — ^ijoimn — — Wijjilmn — 0. Thus, 

we have shown that all coefficients WijUmm except for woooooo^ that may appear in the result of taking the trace of VT'^i"^ 2 BiB 2 CiC 2 
with a general combination of of the form (SIO), must vanish. This is also a sufficient condition for the 

normalization condition (46) to hold. All these forbidden terms for a process matrix are listed in Table (SI). 


Proof S5. Proposition III.2. 

Explicitly, by the definition of (no) signaling ( 6 ) and the expression for the probabilities of a process in terms of the process 
matrix (44), there is no signaling from (1 and 2 and ■ ■ ■ and k) to (k + 1 and k + 2 and ■ ■ ■ and n) if and only if 




= Tr 


yplll2 -ni«2 ^^hl2 g, , , . g, ^^1*2 ^ ^(i+l),(i+l)2 Ig, , . , Ig, ^«in2 j 


= Tr[lT'^+'>4<r+i)2-«i«2 ^y^^(*+i).(/;+i)2^...^^«,«2^j = p(o^+i^... ,{M"..)), 


for all local quantum operations {A1',), ■ ■ ■ , {AdJJ,,), where M ' ^ M ^ V/. Here, the operator vp(*+i)id:+i )2 - nin 2 given 

by 


^(^+1)i(^+1)2'"«1M2 


Tr 


l,l 2 -i:ii:; 


IT 


Iil2-nin2 


d\, ■■■dk, 


(S15) 


which is obtained for the case where M = = I, ■■■ ,k. This condition is equivalent to the condition that 


Tr 


1 | l2—k\k2 


VTl'' 2 '">« 2 ^M'‘l 2 ®---®M*'^ 2 g,|(/:+l)l(/:+l) 2 -J ^ ^(i:+l)l(A:+l) 2 -ni« 2 ^ VM^‘, • • • , , (S16) 


where are the CJ operators of CPTP maps (this is because any linear operator y(*+i)i(^+i )2 - nin 2 f^JJy deter¬ 
mined by the values of Tr |^y(^+i)i(*:+') 2 - «i «2 ^;\^(k+\)i{k +\)2 0 ... 0 for all possible > Q, • • • , > 0). To 
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analyze the role of the different types of terms in satisfying or violating condition (S16), consider the representation of jy'i 
as a linear combination of Hilbert-Schmidt terms of different types and the contribution that each such term makes to the quantity 
on the left-hand side of Eq. (S16). Assume that contains only terms of the types stated in Proposition III.2. The iden¬ 
tity term is such a term. When the identity term is partially traced with any combination of local CPTP maps M ' ^ ■ ,M ' ^, 

it yields exactly the right-hand side of Eq. (S16). From the rest of the terms that satisfy the condition in Proposition III.2, we 
can distinguish two types. The first type are those that have a nontrivial cr operator on i\ and 1 on i 2 for some i - 1, • • • ,k. 

They yield zero when partially traced with any combination of local CPTP maps M ' ~■ ,M ' ^, since a CPTP map M ' ^ does 
not contain terms of type q (which is necessary to get a non-trivial partial trace with the term in question). The second type of 
terms are those that do not have any nontrivial cr operator on any of the systems q and 12 , 1 - 1 , • • • ,k, and hence, when partially 
traced with any combination of local CPTP maps M*’ • • • , only the 1 components of those CPTP maps contribute to 

the result, which by definition yields the right-hand side of Eq. (S16). Therefore, if contains only the types of terms 

stated in Proposition III.2. 

To prove the reverse, assume that contains at least one term whose restriction onto 1 112 • ■ ■ kik 2 is not a valid term for 

a process matrix for {1, • • • ,k}. Every such term has the form ®cr“ 2 ®- ■ ■® 6 >“i"®cr“ 2 '® ®. •.® ]l'>'‘" 2 ig)g(*:+i)i(*+i )2 -nin 2 ^ 

where a‘, i - 1 , • • • ,k, are different numbers from \ io k, \ < m < k, is either the identity or some nontrivial cr operator 
on a'j, cr “2 is a nontrivial cr operator on cr^, and g<^+')i(*:+') 2 - nin 2 ^ non-zero operator on (k + l)i(k + 1)2 ■■ ■ nin 2 , which is 
proportional to a tensor product of nontrivial cr operators and 1 on the different subsystems, such that the whole term is an 
allowed term for a process matrix. We want to show that if such a term is present in the process matrix, Eq. (S16) can be 
violated for a specific choice of the local CPTP maps Out of all such terms, consider one for which m has 

the smallest value (there may be more than one of these). Consider the following choice of local CPTP maps constructed based 


on this term: for i - m + I, - ■ ■ ,k, choose M 


= ^]l“'i“ 2 , and for j - - ■ ,m, choose = j^-(]l“i “2 -h ® cr“' 2 ), 


where e^j > 0 is such that (]l"i “2 + e^jO"' ® cr“' 2 ) > 0 (this can always be ensured for sufficiently small non-zero e^j)- Consider 

_ 11 12 _ Ic 11^2 

the Hilbert-Schmidt expansion of the tensor product M ® ■ ■ ■ ® M . From this expansion, only the identity term and the 
term proportional to ® cr“2 ® ® 6)“"' (® cr"” will survive when we plug M ' ^ ® ■ ■ ■ ® M ' Mn the expression on the left-hand 

side of Eq. (S16). This is because in order for any other term to survive, it would be necessary that VT '‘'2 '"i "2 contains a term of 
a form similar to ® cr "2 (® • ■ ■ ® ( 9 ““ ® cr "'2 ® £""”‘“ 2 ”' ® . • • (g ]l "*"2 (g q(<^+ 1 )i(<:+ 1)2 -«i «2 ^ smaller value of m than the 

one we have chosen, which contradicts the assumption that we have chosen the smallest value. Plugging M ' ^ ® ■ ■ ■ ® M ' ^ in 
the expression on the left-hand side of Eq. (S16) therefore yields + ^g(*:+i)i(<:+i )2 -nin 2 for some e > 0, which is 

different from the right-hand side of Eq. (S16). This completes the proof of Proposition III.2. 


Proof S 6 . Proposition III.3. The fact that this form is sufficient for the process matrix to be ECS is obvious because if this 
is true for each of the individual terms, any extension ® ® ® 

pA\B\c\ (gp'^i^'iC is also causally separable. The fact that it is necessary can be seen as follows. Let us choose 

Yvjrich is a tensor product of three bipartite maximally entangled states of the type used in the ‘teleportation’ argument, 
one shared between Alice and Bob, the other one between Alice and Charlie, and the third one between Bob and Charlie. For 
this particular ancilla, it must be possible to write the extended process in the form 


\y^^i^'^2BlB2CiC2 ig, pA\B\C[ 


q^^MA2B,B.CiC2 ^ pA\B\C\ 




^^^a,A 2B,B2C,C2 ®pA;B',c; 


(S17) 


where ^^i'‘' 2 BiB 2 CiC 2 ^pA\B\c\ causally separable and compatible with {A,B)iC, ^^ 1 '‘ 12 BiB 2 CiC 2 ^pA\B^c\ causally separable 
and compatible with (A, C)iB, and vp^i'‘' 2 BiB 2 CiC 2 ^pA[B[c\ causally separable and compatible with {B, C)iA. (This is because 
the state p'^'i^'i‘'5 is pure.) But for each of these terms, we can perform the ‘teleportation’ argument exploiting the respective 
maximally entangled bipartite state contained in proving that ^^i'‘^ 2 BiB 2 CiC 2 form we obtained for 

^AiA 2 B,B 2 C,C 2 obtained for and iy^i'‘‘ 2 BiB 2 C,C 2 obtained for This 

i ecs;{A,C)SB 3 ecs-,{B,C)SA 

completes the proof. 


